SPSS – How to Correctly Treat Multiple Data Points per Subject

I am currently arguing with someoe on how to correctly treat data with multiple measurements for each subject. In this case data was gathered for each subject within a short time for different conditions within each subject. All measurements gather exactely the same variable, just multiple.

One option now is to just group the data by conditions and not care that multiple data points come from one subject. However the data points from each subject are probably not completely independent.

The other alternative is, to first take the mean of all measurements for each condition from each subject and then compare the means. However this will probably impact the significance, since in the final analysis it is not taken into account, that the means have less error.

How can you correctly analyse such data? Is this somehow taken care of in SPSS? In principle it should be possible to calculate the error margin when calculating a mean and than considering this in the final analysis, but I do not guess that SPSS is somehow doing this calculation behind my back.

Best Answer

It would be a violation of independence to "group the data by conditions and not care that multiple data points come from one subject". So that is a no go. One approach is to "to take the mean of all measurements for each condition from each subject and then compare the means". You could do it that way, you wouldn't violate independence, but you are losing some information in the aggregation to subject level means.

On the face of it, this sounds like a mixed design with conditions between subjects and multiple time periods measured within subjects. However, that raises the question, why did you collect data at multiple time points? Is the effect of time, or the progression of a variable over time expected to be different between conditions? If the answer is yes to either of those questions, then given the structure of the data, I would expect that what you are interested in is a mixed ANOVA. The mixed ANOVA will partition the subject variance out of the SSTotal "behind your back" as it were. But whether that partitioning helps out your between subjects test of conditions depends on several other factors.

Anyway, in SPSS/PASW 18 Analyze -> General Linear Model -> Repeated Measures. You'll have one row for each subject and one column for each time point as well as one as their condition identifier. The condition identifier will go into the "between" section and the repeated measures will be taken care of when you define the repeated measure factor.

Best Answer

Related Solutions

Dataset – How to Organize Data for Repeated-Measure Within-Subject Setup?

Solved – How to use ANOVA with two or more data points per condition per participant

Related Question