Solved – Can component scores be used for further analyses, e.g. cluster analysis

clusteringfactor analysisk-meanspca

I have done a principal component analysis using SPSS and now have 3 components.
2 components have 4 items in the subscale, and 1 component has 3 items. Component scores using regression for each factor were added as variables to the data set by SPSS.

My question is: can the component scores (the new variables that SPSS added to the data set) be used as the variables for K-means cluster analysis to form clusters? My aim is to find out the exact number of cases in each factor, which I intend to do through cluster analysis. Is this the statistically correct method?
Thank you for any input.

Best Answer

My impression is that yes, you can certainly do a cluster analysis using the component scores. If you are interested in clusters of individuals based on these component scores (can we say latent factors?) then doing the analysis using the component scores makes complete sense. Lets say you are measuring factors of personality, you would likely be far more interested in clusters determined using the component scores (i.e., measuring personality) than you would on individual items.

This, however, might not be the case if you are trying to do some sort of latent cluster analysis. In this case, you may want to use individual items to estimate the latent factors in the analysis.