Solved – Calculate odds ratio confidence intervals from plink output

confidence intervalgeneticsodds-ratio

I have output from plink haplotype analysis, however I do not have the raw data. Here, is the output for Haplotype-based association tests with GLMs:

SNP1    SNP2    HAPLOTYPE   F   OR  STAT    P
rs1 rs2 22  0.00992 4.23    61.5    4.43E-15
rs1 rs2 12  0.038   1.02    0.217   0.642
rs1 rs2 21  0.00015 5.22E-10    453 1.77E-100
rs1 rs2 11  0.952   0.762   22.9    1.73E-06

Here is the explanation for each column from plink:

        SNP1    SNP ID of left-most (5') SNP
        SNP2    SNP ID of left-most (3') SNP
   HAPLOTYPE    Haplotype 
           F    Frequency in sample
          OR    Estimated odds ratio
        STAT    Test statistic (T from Wald test)
           P    Asymptotic p-value

Question: based on above output, is it possible to calculate OR 95% confidence intervals?

Best Answer

For the calculation of confidence intervals you'll need standard errors for the effects, but those are not available in the output. However, the standard errors can be estimated from the Wald statistics and odds ratios.

The calculation goes as follows:

Take a natural logarithm from the odds ratio. This gives you the beta from the logistic model. For example for the first row of your table: beta=ln(4.23)=1.442
The standard error for the beta is calculated by dividing the beta by the square root of the Walds statistic (STAT). Then take the absolute value of the result. Again, for the first row of your table: se=1.442/sqrt(61.5)=0.183.
The 95% confidence interval for the beta is then beta+/-1.96*se. The constant 1.96 comes from the normal distribution. Again, for the first row of data: 1.442-1.96*0.183 ... 1.442+1.96*0.183 = 1.081...1.802.
Last, you need to change the confidence interval of the beta to the confidence interval of the odds ratio. This happens simply by exponentiating the confidence interval of the beta. For the first line of data: 2.71828^1.081 = 2.949 and 2.71828^1.802 = 6.065.

So, your odds ratio for the first row of the table is 4.23 and it's 95% confidence interval is 2.949-6.065. Because the confidence interval does not include one, the results is statistically significant. The results are subject to error due to rounding of the output from PLINK.

This calculation can be achieved in, e.g., Excel, but below is also an R function that does the same thing (just in case you also use R).

# The data
or<-structure(list(SNP1 = structure(c(1L, 1L, 1L, 1L), .Label = "rs1", class = "factor"), 
SNP2 = structure(c(1L, 1L, 1L, 1L), .Label = "rs2", class = "factor"), 
HAPLOTYPE = c(22L, 12L, 21L, 11L), F = c(0.00992, 0.038, 
0.00015, 0.952), OR = c(4.23, 1.02, 5.22e-10, 0.762), STAT = c(61.5, 
0.217, 453, 22.9), P = c(4.43e-15, 0.642, 1.77e-100, 1.73e-06
)), .Names = c("SNP1", "SNP2", "HAPLOTYPE", "F", "OR", "STAT", 
"P"), class = "data.frame", row.names = c(NA, -4L))

# The function
orci<-function(or) {
   or$beta<-log(or$OR)
   or$se<-abs(or$beta/sqrt(or$STAT))
   or$lower<-or$beta-1.96*or$se
   or$upper<-or$beta+1.96*or$se
   or$LOWER<-exp(or$lower)
   or$UPPER<-exp(or$upper)
   or$res<-paste(or$OR, " (", round(or$LOWER, 3), "-", round(or$UPPER, 3), ")", sep="")
   return(or)
}

# The calculation
orci(or)

# The result
#SNP1 SNP2 HAPLOTYPE       F       OR    STAT         P         beta         se        lower       upper        LOWER        UPPER                  res
#1  rs1  rs2        22 0.00992 4.23e+00  61.500  4.43e-15   1.44220199 0.18390288   1.08175235   1.8026516 2.949844e+00 6.065710e+00    4.23 (2.95-6.066)
#2  rs1  rs2        12 0.03800 1.02e+00   0.217  6.42e-01   0.01980263 0.04251018  -0.06351733   0.1031226 9.384579e-01 1.108627e+00   1.02 (0.938-1.109)
#3  rs1  rs2        21 0.00015 5.22e-10 453.000 1.77e-100 -21.37335353 1.00420775 -23.34160072 -19.4051063 7.292419e-11 3.736538e-09 0.000000000522 (0-0)
#4  rs1  rs2        11 0.95200 7.62e-01  22.900  1.73e-06  -0.27180872 0.05679965  -0.38313603  -0.1604814 6.817202e-01 8.517337e-01  0.762 (0.682-0.852)

Related Solutions

Solved – How to calculate confidence intervals for pooled odd ratios in meta-analysis

In most meta-analysis of odds ratios, the standard errors $se_i$ are based on the log odds ratios $log(OR_i)$. So, do you happen to know how your $se_i$ have been estimated (and what metric they reflect? $OR$ or $log(OR)$)? Given that the $se_i$ are based on $log(OR_i)$, then the pooled standard error (under a fixed effect model) can be easily computed. First, let's compute the weights for each effect size: $w_i = \frac{1}{se_i^2}$. Second, the pooled standard error is $se_{FEM} = \sqrt{\frac{1}{\sum w}}$. Furthermore, let $log(OR_{FEM})$ be the common effect (fixed effect model). Then, the ("pooled") 95% confidence interval is $log(OR_{FEM}) \pm 1.96 \cdot se_{FEM}$.

Update

Since BIBB kindly provided the data, I am able to run the 'full' meta-analysis in R.

library(meta)
or <- c(0.75, 0.85)
se <- c(0.0937, 0.1029)
logor <- log(or)
(or.fem <- metagen(logor, se, sm = "OR"))

> (or.fem <- metagen(logor, se, sm = "OR"))
    OR            95%-CI %W(fixed) %W(random)
1 0.75  [0.6242; 0.9012]     54.67      54.67
2 0.85  [0.6948; 1.0399]     45.33      45.33

Number of trials combined: 2 

                         OR           95%-CI       z  p.value
Fixed effect model   0.7938  [0.693; 0.9092] -3.3335   0.0009
Random effects model 0.7938  [0.693; 0.9092] -3.3335   0.0009

Quantifying heterogeneity:
tau^2 < 0.0001; H = 1; I^2 = 0%

Test of heterogeneity:
    Q d.f.  p.value
 0.81    1   0.3685

Method: Inverse variance method

References

See, e.g., Lipsey/Wilson (2001: 114)

Confidence Interval – Calculating Odds Ratio and Confidence Interval in Meta-Analysis

I did the following in Stata, the first is fixed effect and the second is random effect. I got different answers than you did.

           Study     |     ES    [95% Conf. Interval]     % Weight
---------------------+---------------------------------------------------
1                    |  2.700       1.800     4.000         63.47
2                    |  1.300       0.500     3.400         36.53
---------------------+---------------------------------------------------
I-V pooled ES        |  2.189       1.312     3.065        100.00
---------------------+---------------------------------------------------
 Heterogeneity calculated by formula
  Q = SIGMA_i{ (1/variance_i)*(effect_i - effect_pooled)^2 } 
where variance_i = ((upper limit - lower limit)/(2*z))^2 



 Heterogeneity chi-squared =   2.27 (d.f. = 1) p = 0.132
  I-squared (variation in ES attributable to heterogeneity) =  56.0%

  Test of ES=0 : z=   4.89 p = 0.000

. metan or ll ul, effect(Odds Ratio) null(1) lcols(trialname) texts(200) random

           Study     |     ES    [95% Conf. Interval]     % Weight
---------------------+---------------------------------------------------
1                    |  2.700       1.800     4.000         55.93
2                    |  1.300       0.500     3.400         44.07
---------------------+---------------------------------------------------
D+L pooled ES        |  2.083       0.721     3.445        100.00
---------------------+---------------------------------------------------
 Heterogeneity calculated by formula
  Q = SIGMA_i{ (1/variance_i)*(effect_i - effect_pooled)^2 } 
where variance_i = ((upper limit - lower limit)/(2*z))^2 

  Heterogeneity chi-squared =   2.27 (d.f. = 1) p = 0.132
  I-squared (variation in ES attributable to heterogeneity) =  56.0%
  Estimate of between-study variance Tau-squared =  0.5488

  Test of ES=0 : z=   3.00 p = 0.003

Best Answer

Related Solutions

Solved – How to calculate confidence intervals for pooled odd ratios in meta-analysis

Confidence Interval – Calculating Odds Ratio and Confidence Interval in Meta-Analysis

Related Question