Correlation – Should Paired Data Correlate When Using Paired Samples T Test?

correlationpaired-datapearson-rt-test

I have repeated measurements from a usability study on 2 different interfaces (n=23). Since the difference between the measurements seems to be normally distributed I used paired samples t-test to compare the interfaces, however there's only a weak correlation between the two measurements (based on the Pearson correlation coefficient).

My question is whether it is required to have a strong correlation between the two samples for the paired samples tests, especially for the paired samples t test?

Best Answer

My question is whether it is required to have a strong correlation between the two samples for the paired samples tests, especially for the paired samples t test?

It's not required at all. The test will still work as it should if they're not just weakly correlated, but even literally uncorrelated.

The impact on power of using a paired test when the pairing carries literally no information at all (i.e. that the values are independent) is very small, even a small amount of correlation will tend to make it worthwhile.

Indeed the difference is so small that I have seen people recommend that where there's doubt about whether two samples should be considered paired (e.g. the samples might have come from pairs but it's not certain whether they're both in the same order), to simply assume that they are paired, since the cost in power of doing it when it's unpaired is quite small but the gain when it is paired is potentially quite large. [This doesn't require any of that dangerous "data snooping".]

As you see, the impact on power of using a paired test when the paired values are independent, at these fairly typical choices of $n$ and $\alpha$, is extremely small; these calculations (via power.t.test in R) were confirmed by performing simulation at some of the values.

#code for power curve comparison
delta=seq(0,1,len=21)
ppair=power.t.test(n = 30, delta = delta, sd = sqrt(2), 
                  sig.level = 0.05, type = "paired",  
                  alternative = "two.sided", strict = TRUE)

punpair=power.t.test(n = 30, delta = delta, sd = 1, 
                  sig.level = 0.05, type = "two.sample", 
                  alternative = "two.sided", strict = TRUE)


with(punpair,plot(power~delta,type="l",col=2,ylim=c(0,1)))
with(ppair,points(power~delta,type="l",col=3))

Here's example simulation code:

# example simulated values as a check, at delta=0.5:
res = replicate(10000,{
         x=rnorm(30);y=rnorm(30,.5);  
         c(pu=t.test(x,y,var.equal=TRUE)$p.value,
           pp=t.test(x,y,paired=TRUE)$p.value)
       })
apply((res<.05),1,mean)    
    pu     pp 
0.4699 0.4612 

# another set of simulated powers at the same delta:
apply((res<.05),1,mean)
    pu     pp 
0.4738 0.4634 

# example simulation at delta=0.8:
res = replicate(10000,{
         x=rnorm(30);y=rnorm(30,.8);  
         c(pu=t.test(x,y,var.equal=TRUE)$p.value,
           pp=t.test(x,y,paired=TRUE)$p.value)
       })
apply((res<.05),1,mean)
    pu     pp 
0.8625 0.8503

The first example simulated power is roughly the region where the gap in power curves is largest, near delta=0.5 (the ratio is biggest near delta = 0.4, the raw difference is biggest just above 0.6)

If you plot these simulated values, they are quite close to the corresponding curves; resimulating, the points wiggle about a little but the difference typically remains similar to the gap in the true curves. I didn't show these points on the plot here, as I felt it was somewhat distracting from the broader point of showing the curves.

Related Solutions

Solved – Paired t-test for binary data

If I understand the context correctly, then McNemar's test is exactly what you want. It compares two binomial variables measured in each subject, sort of a paired chi-square test. The key point is that your data are paired -- you've measured two different binomial outcomes in each subject, so need a test that accounts for that.

Solved – Paired t-test and correlation

The coefficient of correlation and paired t-test are getting at different things. The two tests don't need to align in terms of statistical significance. Consider the following four scenarios, coded in R.

# same mean, no correlation
# t.test Not significant
# cor.test Not significant
options(scipen = 99)
set.seed(1)
s1 <- rnorm(20,0,1)
s2 <- rnorm(20,0,1)
t.test(s1,s2,paired=T)$p.value
    cor.test(s1,s2)$p.value

# different means, no correlation
# t.test Significant
# cor.test Not significant
set.seed(2)
s1 <- rnorm(20,0,1)
s2 <- rnorm(20,2,1)
t.test(s1,s2,paired=T)$p.value
    cor.test(s1,s2)$p.value

# different means, high correlation
# t.test Significant
# cor.test Significant
set.seed(3)
s1 <- rnorm(20)
s2 <- s1+2+rnorm(20,0,0.5)
t.test(s1,s2,paired=T)$p.value
    cor.test(s1,s2)$p.value

# same means, high correlation
# t.test Not significant
# cor.test Significant
set.seed(4)
s1 <- rnorm(20)
s2 <- s1+rnorm(20,0,0.5)
t.test(s1,s2,paired=T)$p.value
    cor.test(s1,s2)$p.value

Not seeing a significant correlation between your two tests may be a sign the measurement error of your tests is high for your context. You want the standard deviations of your samples to be close to what you would see in practice and you need them to be much greater than your measurement error to detect a correlation in only 20 samples. Consider this final example where the measurement error is high in sample 2.

# same means, low correlation because
# high measurement error in sample 2
# t.test Not significant
# cor.test Not significant
set.seed(5)
s1 <- rnorm(20)
s2 <- s1+rnorm(20,0,3)
t.test(s1,s2,paired=T)$p.value
    cor.test(s1,s2)$p.value

Best Answer

Related Solutions

Solved – Paired t-test for binary data

Solved – Paired t-test and correlation

Related Question