Logistic – Intercept of Logistic Regression with Contrast Coding for Better Model Interpretation

contrastsinterceptlogistic

Say I have a binary dependent variable (Choice) being either 0 or 1, and people answer this DV multiple times. I also evenly split people in two groups (Group, group A vs group B).

I simulate data so that I know that there's an overall 50% probability to choose 1 in group A, but only an overall 5% probability to choose 1 in group B. The average sample probability to choose 1 (both groups combined) is around 27.5%.

I then run a generic generalized linear mixed model with a binomial distribution : Choice ~ Group + (1 | participant).

If I rely on dummy coding, the estimates for the intercept makes sense to me. That is, if I put A = 0 and B = 1, thus choosing A as reference group, the intercept can be transformed into the overall probability to choose a 1 in group A. This checks out, as I obtain ~50%. Same goes when I code A = 1 and B = 0, thus choosing B as reference group : I get a 5% probability to choose a 1 in group B.

However, when I rely on contrast coding, I'm getting lost. If I code A = -1/2 and B = 1/2, I'm expecting to observe the average probability (as 0 is the value between the two groups), thus around 27.5%. But when I transform the intercept of this contrast-coded model into probability, I'm obtaining 18%. Why is that ?

Best Answer

With contrast coding, the intercept is the average of the group effects on the logit scale, not the average of the group probabilities.

# Use `logit` and `inv_logit` for clarity
logit <- function(x) qlogis(x)
inv_logit <- function(x) plogis(x)

# (Expected) Intercept
(logit(0.5) + logit(0.05)) / 2
#> [1] -1.472219

# Probability for the "average" group
inv_logit(-1.472219)
#> [1] 0.1866056

You mention that the sampling is balanced. This information is irrelevant for your question: the expected value of the intercept changes with the coding/parametrization but not with the sampling proportions. The number of participants in each group determines how efficiently we can estimate group parameters: fewer participants, larger standard errors.

Related Solutions

Solved – Interpret logistic regression output with multiple categorical & continious variables

Your understanding seems generally correct. The intercept in this and in other standard R regression summaries represents the case for the reference levels of all categorical variables (false for logical) and for a 0 value of all continuous variables.

So for your question 2 the reference is occ.group1 and area0, as it is for all comparisons given the way you have labeled the levels of the variables. occ.group1 and area0 as in your question 3 is the reference group with odds calculated from the intercept for a 0 wage, but you need to specify an hourly wage to get the odds for a non-zero wage.

Your interpretation in question 4 seems to be a bit off. If the areas and wages are the same for the two groups then the only difference to consider in the specified comparison is the coefficient for occ.group2 versus the occ.group1 reference.

Solved – What statistical test does -contrast- use after regression in Stata

Here are examples showing this in Stata for OLS, but the intuition carries over to the logit index function coefficients.

Both of these are doing Wald tests. Let's do the second one first. You are testing that the linear index function inside the logit is the same for all groups each relative to the base value, so that is the same test that all the coefficients are zero (excluding the constant). The joint tests that all these differences are the same:

cls
net from http://www.stata-press.com/data/ivrm/
net get ivrm
use "pain.dta", clear
reg pain i.dosegrp
test 2.dosegrp
test 3.dosegrp
test 4.dosegrp
test 5.dosegrp
test 6.dosegrp
test 2.dosegrp 3.dosegrp 4.dosegrp 5.dosegrp 6.dosegrp  

contrast r.dosegrp, noeffects

The polynomial contrast is more involved. Trend analysis partitions the sum of squares for the model into portions due to linear trend, quadratic trend, cubic trend, and so on. If there are $K$ groups, it is possible to look at up to $K - 1$ trends. Here's an example with $K=6$ on the same data as above.

Trend analysis is performed using coefficients of orthogonal polynomials. For 6 levels, they look like:

             g1  g2  g3  g4 g5   g6
Linear      -5  -3  -1   1   3   5
Quadratic    5  -1  -4  -4  -1   5
Cubic       -5   7   4  -4  -7   5
Quartic      1  -3   2   2  -3   1
Quintic     -1   5 -10  10  -5   1

One way to perform trend analysis is to use the coefficients of orthogonal polynomials to weight the group sums to compute sums of squares for each of the trends. An alternative is to apply the coefficients of orthogonal polynomials directly to the observations and to analyze using regression. This creates a set of variables such that the "effects" of all the preceding variable have been removed from each variable:

net from http://www.stata-press.com/data/ivrm/
net get ivrm
use "pain.dta", clear

/* (1) Automated Polynomial Contrasts */
regress pain i.dosegrp
contrast p.dosegrp, noeffects

/* (2) Slightly More Manual With User-Defined Contrasts on the levels of dosegrp */
contrast {dosegrp -5  -3  -1   1   3   5}  ///
         {dosegrp  5  -1  -4  -4  -1   5}  ///
         {dosegrp -5   7   4  -4  -7   5}  ///
         {dosegrp  1  -3   2   2  -3   1}  ///
         {dosegrp -1   5 -10  10  -5   1}  ///
         {dosegrp -1   5 -10  10  -5   1}, noeffects

/* (3) Regression method where Stata creates the varibles for you */
orthpoly dosegrp, deg(5) generate(op*)
regress pain op*
test op1 // linear
test op2 // quadratic
test op3 // cubic
test op4 // quartic
test op5 // quintic
test op1 op2 op3 op4 op5 // joint

/* (4) Very Manual Way Using Regression Where You Hard Code Everything */
forvalues i=1/5 {
    gen o`i' = .
}

replace o1 = -5 if dosegrp == 1
replace o2 =  5 if dosegrp == 1
replace o3 = -5 if dosegrp == 1
replace o4 =  1 if dosegrp == 1
replace o5 = -1 if dosegrp == 1

replace o1 = -3 if dosegrp == 2
replace o2 = -1 if dosegrp == 2
replace o3 =  7 if dosegrp == 2
replace o4 = -3 if dosegrp == 2
replace o5 =  5 if dosegrp == 2

replace o1 =  -1 if dosegrp == 3
replace o2 =  -4 if dosegrp == 3
replace o3 =   4 if dosegrp == 3
replace o4 =   2 if dosegrp == 3
replace o5 = -10 if dosegrp == 3

replace o1 =  1 if dosegrp == 4
replace o2 = -4 if dosegrp == 4
replace o3 = -4 if dosegrp == 4
replace o4 =  2 if dosegrp == 4
replace o5 = 10 if dosegrp == 4

replace o1 =  3 if dosegrp == 5
replace o2 = -1 if dosegrp == 5
replace o3 = -7 if dosegrp == 5
replace o4 = -3 if dosegrp == 5
replace o5 = -5 if dosegrp == 5

replace o1 = 5 if dosegrp == 6
replace o2 = 5 if dosegrp == 6
replace o3 = 5 if dosegrp == 6
replace o4 = 1 if dosegrp == 6
replace o5 = 1 if dosegrp == 6

tw connected o1 dosegrp, sort
tw connected o2 dosegrp, sort
tw connected o3 dosegrp, sort
tw connected o4 dosegrp, sort
tw connected o5 dosegrp, sort

reg pain o1 o2 o3 o4 o5
test o1 // linear
test o2 // quadratic
test o3 // cubic
test o4 // quartic
test o5 // quintic
test o1 o2 o3 o4 o5 // joint

Best Answer

Related Solutions

Solved – Interpret logistic regression output with multiple categorical & continious variables

Solved – What statistical test does -contrast- use after regression in Stata

Related Question