Solved – Do data transformations before factor analysis need to be consistent across different variables

data transformationexploratory-data-analysisfactor analysisskewness

(This question continues the previous one)

I am creating a questionnaire, and I have identified 3 questions which are skewed (2 positively skewed & 1 negatively skewed). I successfully transformed two of the questions using Lg10 and inverse of Lg10 on SPSS, but the second positively skewed question is still positively skewed even after the Lg10 transformation. My questions are the following:

Is it "okay" for the question to still be positively skewed after the transformation? Is any further action needed (any further transformation(s))?
Can I use a different transformation on this specific question (the remaining positively skewed one) or do I have to use the same transformation for all of the skewed questions?
What is the next "strongest" transformation after Lg10 on SPSS? How would it be entered on SPSS? (e.x. negatively skewed question = Lg10((Max. Score + 1) – Question))

Best Answer

Regarding 1) Factor analysis is based on correlations/covariances. When a highly skewed variable is part of a correlation, the correlation can be affected by the extreme points. This will affect the factor analysis, although I do not know of literature on the extent of the effect (it's probably been studied, though).

Regarding 2) You do not need to use the same transformation on each variable. But transforming variables in different ways and then doing factor analysis can lead to factors that are somewhat hard to interpret.

Regarding 3) I don't know SPSS, sorry.

More generally, what is the nature of these questions? Are they Likert-type scales? Physical measurements? Or what? Ideally, you could tell us what they actually mean.

Best Answer

Related Solutions

Solved – Are data transformations on non-normal data necessary for an exploratory factor analysis when using the principal axis factoring extraction method

Solved – “Wrong Sign” On Regression Coefficients – Hierarchical Multiple Linear Regression

Related Question