Factor Analysis – Conducting It on Multiply Imputed Data with Procrustes Analysis

factor analysismissing datamultiple-imputationprocrustes-analysis

I have a data set with approximately 500 observations on eight key variables. There are a lot of missing data; only about 1/12 of the observations are complete. I am using PROC MI and MIANALYZE in SAS to run various regressions on multiply imputed data, and this is working well. (There are about 200 variables in total, and there are high correlations among them which helps multiple imputation.)

However, I would also like to do factor analysis on the imputed data. This does not appear to be easily done in SAS, and it poses some interesting challenges:

  1. The signs of the factors are arbitrary, so different imputations could yield opposite signs;
  2. What was factor 1 in one imputation could be factor 2 in another imputation (although in this case, with so few key variables, it is likely that one factor is enough),

and probably other issues as well.

I could do exploratory factor analysis on each imputed data set, of course, and I could then average them on a sort of ad-hoc basis, but this seems very sloppy.

Some Googling did not reveal any solutions, but … well, Google doesn't always find everything.

Any help, ideas, references appreciated. I have access to SAS and to R.

Best Answer

Factor analysis or principal components analysis may indeed yield solutions whose answers are rotated or mirrored versions of each other, so averaging the person scores is not a good idea. Several authors have explored the use of Procrustes analysis to correct for the rotational indetermination, so try searching on 'multiple imputation' and 'procrustes'.

Related Question