Solved – How to get a $p$-value from the Cochran-Armitage trend test

association-measurechi-squared-testgeneticsp-value

So, I'm working with GWAS SNP data and want to perform several tests for association between genotype and phenotype. There are two phenotypes (case and control) and 2 or three genotypes. Most of them are Chi-squared tests with different contingency tables, $2 \times 2$ or $2 \times 3$, one of them is the Cochran-Armitage trend test (CATT)

Once I have constructed the contingency table, I can easily get a $p$-value using the Apache commons math library for the Chi-squared tests. No problem.

However, the explanation of the CATT on Wikipedia is not sufficient for me to implement it (my statistics knowledge is limited and I'm still learning).

Like in the example, I suspect a linear trend, so my weights are $t = (0,1,2)$, which make the formula for $T$ to:
$$
T \equiv (N_{12}R_2 – N_{22}R_1) + 2(N_{13}R2 – N_{23}R1)
$$
and the one for the variance
$$
Var(T) = {{R_1 R_2} \over N} ( N(C_2+4C_3) – (C_2 – 2C_3)^2)
$$

I checked how the program PLINK does it, since it's already implemented there, but it differs slightly from the above formulas. The C++ source code there would correspond to this:
$$
T = {(N_{12}R_2 – N_{22}R_1) + 2(N_{13}R2 – N_{23}R1)\over N}
$$
and
$$
Var(T) = {{R_1 R_2} \over N} {( N(C_2+4C_3) – (C_2 – 2C_3)^2) \over N^2}
$$

Then it does calculates a chi-square value like this
$$
\chi^2_{T} = {T^2 \over Var(T)}
$$
and calculates the $p$-value like for any other chi-squared value with $df = 1$

I don't need to understand the theory completely, as long as my program calculates correctly, but understanding it would give me additional confidence.

Is this correct or legitimite? Is this how I'll get the $p$-value?

Best Answer

This is just a different definition of the statistic $T$. Call your statistic $T_1$ and the other $T_2$. Note the $T_2 = T_1/N$ and that is the reason that the variance of $T_2$ differs from $T_1$ by a factor of $1/N^2$. However you should note that the chi square stitistic is the same in either case. For $T_2$ there is a factor of $1/N^2$ in the numerator and denominator that cancels and does not appear in the formula using $T_1$. You use the same test statistic either way.

Related Solutions

Solved – the difference between independence.test in R and Cochrane and Armitage trend test

As a follow-up to my comment, if independence.test refers to coin::independence_test, then you can reproduce a Cochrane and Armitage trend test, as it is used in GWAS analysis, as follows:

> library(SNPassoc)
> library(coin)
> data(SNPs)
> datSNP <- setupSNP(SNPs,6:40,sep="")
> ( tab <- xtabs(~ casco + snp10001, data=datSNP) )
     snp10001
casco T/T C/T C/C
    0  24  21   2
    1  68  32  10
> independence_test(casco~snp10001, data=datSNP, teststat="quad",
                    scores=list(snp10001=c(0,1,2)))

Asymptotic General Independence Test

data:  casco by snp10001 (T/T < C/T < C/C) 
chi-squared = 0.2846, df = 1, p-value = 0.5937

This is a conditional version of the CATT. About scoring of the ordinal variable (here, the frequency of the minor allele denoted by the letter C), you can play with the scores= arguments of independence_test() in order to reflect the model you want to test (the above result is for a log-additive model).

There are five different genetic models that are generally considered in GWAS, and they reflect how genotypes might be collapsed: codominant (T/T (92) C/T (53) C/C (12), yielding the usual $\chi^2(2)$ association test), dominant (T/T (92) vs. C/T-C/C (65)), recessive (T/T-C/T (145) vs. C/C (12)), overdominant (T/T-C/C (104) vs. C/T (53)) and log-additive (0 (92) < 1 (53) < 2 (12)). Note that genotype recoding is readily available in inheritance functions from the SNPassoc package. The "scores" should reflect these collapsing schemes.

Following Agresti (CDA, 2002, p. 182), CATT is computed as $n\cdot r^2$, where $r$ stands for the linear correlation between the numerical scores and the binary outcome (case/control), that is

z.catt <- sum(tab)*cor(datSNP$casco, as.numeric(datSNP$snp10001))^2
1 - pchisq(z.catt, df = 1)  # p=0.5925

There also exist various built-in CATT functions in R/Bioconductor ecosystem for GWAS, e.g.

CATT() from Rassoc, e.g.

with(datSNP, CATT(table(casco, snp10001), 0.5)) # p=0.5925

(additive/multiplicative)

in snpMatrix, there are headed as 1-df $\chi^2$-test when you call single.snp.tests() (see the vignette); please note that the default mode of inheritance is the codominant/additive effect.

Finally, here are two references that discuss the choice of scoring scheme depending on the genetic model under consideration, and some issues with power/robustness

Zheng, G, Freidlin, B, Li, Z and Gastwirth, JL (2003). Choice of scores in trend tests for case-control studies of candidate-gene associations. Biometrical Journal, 45: 335-348.
Freidlin, B, Zheng, G, Li, Z, and Gastwirth, JL (2002). Trend Tests for Case-Control Studies of Genetic Markers: Power, Sample Size and Robustness. Human Heredity, 53: 146-152.

See also the GeneticsDesign (bioc) package for power calculation with linear trend tests.

Solved – Do the properties of Pearson’s chi-squared test for independence hold true for continuous PDFs

In general, because continuous data usually has a dispersion attribute, the Pearson-Chi Square test doesn't make any sense. This is because the expected value in the denominator is attributed to the mean-variance relationship of categorical data. If you're looking for a generalized version of the Pearson chi-squared test statistic, you must specify what attribute of the chi-square statistic is interesting to you. Because it is the score test for a logistic regression model, you could consider using a score test for any other regression model, like linear regression, or use a different test statistic like those from the Wald or Likelihood ratio tests. This addresses the inferential aspect of the test, whether the row and column variables are independent.

Alternately, you may be interested in this test statistic as a measure of goodness of fit or calibration for a predictive model. If you can sensibly bin the data into distinct groups, you can still use the chi-square test statistic, or a kappa statistic measuring agreement between observed and predicted data, especially useful with more than 2 distinct groups in the outcome. I would prefer using the continuous scale and using the MSE as a measure of predictive accuracy, more preferrably the cross-validated MSE of your predictive model. This can also be standardized into a measure like the R^2 value which has associated significance tests as well.

Best Answer

Related Solutions

Solved – the difference between independence.test in R and Cochrane and Armitage trend test

Solved – Do the properties of Pearson’s chi-squared test for independence hold true for continuous PDFs

Related Question