Solved – the likelihood ratio for this chi-square test (assocstats)

chi-squared-testlikelihood-ratior

I am using the assocstats function in the vcd library. It gives something called the likelihood ratio and its associated p-value which is different from the chi-squared statistic and its p-value. What is this likelihood ratio and what is it's distribution?

e.g.,

> library(vcd)
> data("Arthritis")
> tab <- xtabs(~Improved + Treatment, data = Arthritis)
> assocstats(tab)
                    X^2 df  P(> X^2)
Likelihood Ratio 13.530  2 0.0011536
Pearson          13.055  2 0.0014626

Phi-Coefficient   : NA 
Contingency Coeff.: 0.367 
Cramer's V        : 0.394 
> chisq.test(tab)

    Pearson's Chi-squared test

data:  tab
X-squared = 13.055, df = 2, p-value = 0.001463

Best Answer

A test using that likelihood ratio is generally called G-test. With r×s contingency table, $G = 2 \sum_{i=1}^r \sum_{j=1}^s O_{ij} * ln (O_{ij}/E_{ij}) $. $O_{ij}$ is the observed count and $E_{ij}$ is the expected count under the null hypothesis. Distribution of G is approximately a chi-squared distribution. G-test (wikipedia) gives more information.

observe <- mar_table(tab)[1:3, 1:2]  # delete total
expect <- independence_table(tab)

# (observe) Treatment          (expect) Treatment
# Improved Placebo Treated    Improved   Placebo   Treated
#   None        29      13       None   21.500000 20.500000
#   Some         7       7       Some    7.166667  6.833333
#   Marked       7      21       Marked 14.333333 13.666667

G <- 2 * sum( observe * log(observe/expect) )  # [1] 13.52981
chisq <- sum( (observe - expect)^2/expect )      # [1] 13.05502 # as a reference

1 - pchisq(G, df = 2)             # [1] 0.001153559
1 - pchisq(chisq, df = 2)           # [1] 0.001462643

Related Solutions

Computing Pearson’s Chi-Squared Test Statistic for Lack of Fit in Logistic Regression with R

The sum of the squared Pearson residuals is exactly equal to the Pearson $\chi^2$ test statistic for lack of fit. So if your fitted model (i.e., the glm object) is called logistic.fit, the following code would return the test statistic:

sum(residuals(logistic.fit, type = "pearson")^2)

See the documentation on residuals.glm for more information, including what other residuals are available. For example, the code

sum(residuals(logistic.fit, type = "deviance")^2)

will get you the $G^2$ test statistic, just the same as deviance(logistic.fit) provides.

Chi-Squared Test – How to Apply Chi-Squared Test to Multidimensional Data

To analyze a multi-way contingency table, you use log-linear models. In truth, log-linear models are a special case of the Poisson generalized linear model, so you could do that, but log-linear models are more user-friendly. In Python, you may need to use the Poisson GLM, as I gather log-linear models may not be implemented. I will demonstrate the log-linear model using your data with R.

library(MASS)
tab = array(c(95, 31, 20, 70, 29, 18, 21, 69, 98, 54, 35, 11), dim=c(3,2,2))
tab = as.table(tab)
names(dimnames(tab)) = c("outcomes", "actions", "observations")
dimnames(tab)[[1]] = c("0", "1", "2")
dimnames(tab)[[2]] = c("0", "1")
dimnames(tab)[[3]] = c("1", "2") 
tab
# , , observations = 1
#         actions
# outcomes  0  1
#        0 95 70
#        1 31 29
#        2 20 18
# 
# , , observations = 2
#         actions
# outcomes  0  1
#        0 21 54
#        1 69 35
#        2 98 11

Log-linear models are simply a series of goodness of fit tests. We can start with a (trivial) null model that assumes all cells have the same expected value:

summary(tab)
# Number of cases in table: 551 
# Number of factors: 3 
# Test for independence of all factors:
#  Chisq = 159.18, df = 7, p-value = 4.772e-31

The null is rejected. Next, we can fit a saturated model:

m.sat = loglm(~observations*actions*outcomes, tab)
m.sat
# Call:
# loglm(formula = ~observations * actions * outcomes, data = tab)
# 
# Statistics:
#                  X^2 df P(> X^2)
# Likelihood Ratio   0  0        1
# Pearson            0  0        1

Naturally, this fits perfectly. At this point, we could build up from the null model seeing if additional terms improve the fit, or drop terms from the saturated model to see if the fit gets significantly worse. The latter is more convenient and is conventional. To see if the distribution of outcomes by actions differs as a function of the observation, we need to drop the interactions between the observations and the actions * outcomes. If we also drop the marginal effect of observations, we are testing if the mean count differs between the two levels of observations. That may or may not be of interest to you, I don't know.

m1 = loglm(~observations + actions*outcomes, tab)
sum(tab[,,1])  # 263
sum(tab[,,2])  # 288
m2 = loglm(~actions*outcomes, tab)
anova(m2, m1)
# LR tests for hierarchical log-linear models
# 
# Model 1:
#   ~actions * outcomes 
# Model 2:
#   ~observations + actions * outcomes 
# 
#           Deviance df Delta(Dev) Delta(df) P(> Delta(Dev))
# Model 1   126.4172  6                                    
# Model 2   125.2825  5   1.134691         1         0.28678
# Saturated   0.0000  0 125.282534         5         0.00000

Model 1 has dropped a single degree of freedom from Model 2 (note that, confusingly, Model 1 $\leftrightarrow$ m2, and Model 2 $\leftrightarrow$ m1), but the decrease in model fit is very small. It is not significant. There is not enough evidence to suggest that the mean counts differ by observation. On the other hand, when Model 2 is compared to the Saturated model, the decrease in fit is highly significant. The data are inconsistent with the idea that the distribution of counts is the same in both levels of observation.

Best Answer

Related Solutions

Computing Pearson’s Chi-Squared Test Statistic for Lack of Fit in Logistic Regression with R

Chi-Squared Test – How to Apply Chi-Squared Test to Multidimensional Data

Related Question