Solved – How to handle variables with low correlation but high loadings in factor analysis

factor analysis

I am doing factor analysis to check the factorial validity of a 14-items scale with four subscales. Two items have low (less than 0.3) correlations with other items in the subscale to which they belong. However, they load considerably (greater than 0.32) on the factor (latent variable) to which they correspond. Under such condition, should the two variables be dropped from the scale on grounds of low correlations or be retained within the scale on grounds of the acceptable loadings?


Update: After going further through the analysis–removing one item on substantive ground and another item due to low loading (in the range of .2)–I found the correlation matrix depicted below.

enter image description here

The highlighted correlations indicate the correlations of items with other items in the same subscale. However, some items also correlate with items in other subscales considerably (>0.3). Yet, the four factors themselves are inter-correlated which I think is the reason why items also show non-negligible correlations with other items of a different factor. Below is the inter-correlation of factors.

enter image description here

However, as you see hereunder, the loading of each item on the factor to which it belongs is substantial and no cross-loading.

enter image description here

Given these results, how should I go about interpreting my subscales? May I be justified to consider the sub-scales as being suited to measure the underlying constructs/factors of interest?

Best Answer

Do these two correlate highly with each other? If you add another factor to the solution, do they form a separate factor?

If so, then they may be another factor (but with only two variables that relate to it) and this factor may relate to the factor you discuss in your post.

Try different numbers of factors and try oblique rotations as well as orthogonal ones in order to further explore what is going on.

In the end, though, I think the determination should be made based on which solution makes substantive sense.

Related Question