Solved – Multiple comparisons for correlation matrix

correlationmultiple-comparisonsr

Does the computation of an N x N correlation matrix for N unrelated variables require multiple comparisons correction for all the computed pairwise correlations (assuming each computed correlation is a 'comparison' in the sense of being a 'statistical test')? If so, would FWER be a more appropriate measure than Bonferroni, or do any assumptions about the variables have to be made before using FWER? For instance, does it make a difference in what correction should be used whether the variables are a-priori likely to be correlated (e.g. test scores from the same group of subjects) vs uncorrelated (e.g. each variable represents the test score of a different sample)?

Best Answer

This depends on what question you are trying to answer and what your strategy is.

I like to think about what would happen to my conclusions if I were to add to my data some additional columns of randomly generated noise. In your case this would add more correlations.

If you will declare success/significance if any of the correlations are significant (fishing for significance) then yes, you need to do a correction for multiple comparisons because if the truth is nothing is correlated, but you add a bunch of random noise variables and don't adjust, then you will likely see something significant by chance.

If on the other hand there are specific comparisons that are of interest and would have been of interest if only those 2 variables had been in the study/dataset, then you probably don't want to adjust for multiple comparisons. Think about a case where 2 variables are correlated, but you would prefer them not to be (what I want to eat, but my wife doesn't want me to eat vs. a measure of my health), you could add a bunch of random noise variables and adjust for multiple comparisons and the adjustment would change a significant result into a non-significant result (great for justifying my snack, but not really honest).

Related Question