Solved – Assessing fit of binomial glmer() in R with only categorical predictors

binomial distributionlme4-nlmeresampling

I am trying to validate a mixed effects logit regression model with a categorical dependent variable and categorical predictor variables – I have nothing that is continuous. One of my predictor variables is binary, and the other has three possible values (not ordinal). My formula is something like this:

glmer(Response~Handedness+Color+(1|Subject)+(1|Item)+(1|NumResponses),data=r1,family="binomial")

Where Response is something like "yes/no", Handedness is "left/right" and Color is "green/blue/red". I have something like 2000 responses and 500 subjects, though some subjects gave more responses than others, and there is not an even split between lefties and righties (more righties than lefties), and there are more green/red items than blue ones (this is not what the actual data is about, but the point is we weren't able to sample from the equivalent of Handedness and color evenly). Although I have the possibility of using some other predictors and/or random effects, I've already gone through the process of comparing possible models (using AIC/BIC values) by dropping variables and this is the best one. Results look something like this:

AIC  BIC logLik deviance
1733 1755 -828.9     1702
Random effects:
Groups     Name        Variance Std.Dev.
Subject         (Intercept) 2.1680   1.47243 
NumResponses (Intercept) 0.0000   0.00000 
Item    (Intercept) 0.4183   0.64676 
Number of obs: 1708, groups: Subject, 560; NumResponses, 48; Item, 48

Fixed effects:
          Estimate Std. Error z value Pr(>|z|)    
          (Intercept)               1.2097     0.2310   5.238 1.63e-07 ***
ColorGreen  -0.2254     0.3271  -0.689    0.491    
ColorRed  -1.2007     0.2285  -5.254 1.49e-07 ***
HandednessLeft          1.2248     0.2189   5.595 2.20e-08 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

I'm very new to these sorts of models (I'm sure this is glaringly obvious), but if I'm interpreting it correctly and it's not a garbage model, lefties have increased odds of providing a yes relative to righties, and red decreases the odds of providing a yes relative to blue. So the model tells me there is something in my data, but how do I tell if it's even a good model? As far as I understand it, the AIC and BIC tell me it's a better model than some other ones I've tried, but it could still be a horrible fit. I can't figure out how to do much with plotting diagnostics because all of the variables are categorical (although the proportion of yes/no responses for these groups clearly agrees with the results of the model). From here (http://www.ats.ucla.edu/stat/r/dae/melogit.htm), it seems like I should do some sort of bootstrap, but for all categorical variables this seems overly complex to me (or perhaps I'm just recoiling at the idea of implementing it). Is this the best way or is there another approach?

Best Answer

This is a pretty broad question.

Most simply, you could assess the accuracy of predictions; make predictions on the probability (type="response") scale, dichotomize by rounding up or down to 0 or 1, and cross-tabulate the 2x2 table ("predicted yes","predicted no" $\times$ "observed yes", "observed no"). You can calculate overall accuracy, or sensitivity and specificity if you like, and figure out how those measures vary across categories.
If you allow the cut-point to vary from 50%, then you're looking at ROC/AUC measures.

Related Solutions

Solved – Binomial GLMM with categorical predictors: p-values

Copy-Paste of the answer from @Henrik:

use afex::mixed as in mixed(acc ~ race + sex + emotion + sex:emotion + race:emotion + score +(1|subj), family=binomial, data=subset, method = "LRT"). To obtain p-values based on parametric bootstrap, you can use method = "PB" (but you will need to set the number of samples, see help).

Also, you most likely need random slopes for your within-subject factors. Your random effects structure seems unreasonable!

Solved – Maximal model for linear mixed-effects model for repeated mesaures design

The maximal structure would need to include also a random effect for the interaction between color and shape, that is:

Y ~ color * shape + (color + shape + color:shape | subject)

This will result in all your predictors (color, shape and their interaction) having a fixed effect (constant for all subjects), and a random effect (individual fluctuations around the estimated fixed effect). In this sense the model is the maximal one. Note that it might not be fully equivalent to a repeated-measures ANOVA as it doesn't make equally strict assumptions on the correlational structure (see Tom's answer).

If you don't include the interaction in the random effect part of the formula, individual variation in the interaction effect will not be considered as "random", and the model will not be equivalent to a repeated-measures ANOVA. Of course, the variance of the random deviates for the interaction (or any other random effect) might be so small that including it in the model do not improve much the fit. You can check this not only with the AIC, but with a likelihood ratio test, as model with vs without one random effect are nested one another. In principle if the likelihood ratio test is not significant, it means that you can safely remove that random effect. Simplifying the random effect structures by removing negligible components would be an example of what in the article you linked is called data-driven approach.

You can simplify the model in this way, and it would still be equivalent to a repeated-measures ANOVA:

Y ~ color*shape + (1|subject) + (0+color|subject) + (0+shape|subject) + (0+color:shape|subject)

This syntax tells lmer to not estimate the correlations of random deviates across subjects. The drawback here is that, for example, you won't be able to tell whether subjects that have a large effect of color tend to have also a larger effect of shape (or smaller effect, in case of negative correlation).

You can easily include a between-subjects predictor, the only difference is that you can't add a random effect for it. "gender" for example cannot have a random effect grouped according to subject, but it can interact with the other fixed effects, e.g.:

Y ~ color * shape * gender + (color + shape + color:shape | subject)

Best Answer

Related Solutions

Solved – Binomial GLMM with categorical predictors: p-values

Solved – Maximal model for linear mixed-effects model for repeated mesaures design

Related Question