Solved – When to include random slopes in linear mixed models

lme4-nlmemixed model

I'm running a few LMMs and I am finding that I don't quite understand when to include random slopes. I'm worried that my models will be wildly inaccurate and inflate the type I error rate.

First example: I'm testing whether condition (2 levels) affects reaction time and I'm using a mixed model because 1) subjects completed numerous trials on which reaction time was measured, so I want to include random intercepts for subjects, and 2) the reaction time data are from a task that includes two trial types that are not of relevance to the hypothesis, so I want to control for that by including trial type as a fixed effect.

Condition was manipulated within subjects. This is where I start to get confused. I plan to model condition as a fixed effect, like so:

lmer(RT ~ trialtype + condition + (1|subject), data=df, REML=F)

but is it also imperative that I include random slopes for subjects by condition like so?

lmer(RT ~ trialtype + condition + (1+condition|subject), data=df, REML=F)

(Note, to test my model I will compare this to a reduced model that drops the fixed effect of condition. Same is true for the following examples.)

Second example: I'm testing whether a measure of X predicts RT within one condition, which is different measure of X. I set this up like so:

lmer(cond2_RT ~ trialtype + Xmeasure + (1|subject), data=df, REML=F)

But I'm not sure if I should be including random slopes for subjects, like so:

lmer(cond2_RT ~ trialtype + Xmeasure + (1+Xmeasure|subject), data=df, REML=F)

I've read that it's important to include random slopes when you are testing the effect of some variable in order to control the type I error rate, but in this case I'm confused because Xmeasure is a predictor, not a manipulated variable and I don't know that I expect the relation between Xmeasure and cond2_RT to vary by subject. Am I thinking about this incorrectly?

Third and final example: I also want to test whether condition interacts with Xmeasure to predict RT. Here is how I've set this up:

lmer(cond2_RT ~ trialtype + Xmeasure*condition + (1+condition|subject), data=df, REML=F)

But again, I'm not sure if I should also be modeling random subject slopes for Xmeasure and also the interaction between condition and Xmeasure. I'm worried about including too much in the model, and also about not being able to justify the inclusion of these slopes. But I don't want to leave anything out that is crucial.

Thanks in advance for any suggestions!

Best Answer

If you have real doubt about whether or not random slopes would improve your model for whatever purpose you're putting it to, you don't need to decide in advance. Instead, use a generic model-selection technique, such as estimating predictive accuracy with cross-validation and selecting the most predictively accurate model. Just make sure that whatever design you want to consider is identifiable.

Related Solutions

Solved – Two factor linear mixed effect model with multiple slopes (lmer)

Strictly speaking the model you present is syntactically correct.

Usually convergence failures are due to model misspecification, or insufficient data for estimation.

We do not know how many data-points you have so that might be an issue. Here you are specifying a model that has correlations between the random slopes for TIME and TREATMENT. You might want to try a model that has no correlations between the slopes but has two intercepts, eg. lmer(RATIO ~ TREATMENT + TIME + (TREATMENT| SUBJECT) + (TIME | SUBJECT), data=my.data) and see if this is adequate for your purposes. (The 1 + are redundant so I omitted them)

Solved – Random slopes for interactions without random slopes for main effects

I'm not sure this makes sense.

First of all, it won't work the way you expect: A:B will expand to a model of the same dimensionality as A*B (and lme4 isn't quite as smart as it might be about handling the redundancies

set.seed(101)
dd <- expand.grid(A=factor(1:2),B=factor(1:2),Subject=1:40,rep=1:20)
dd$Y <- simulate(~A*B+(A*B|Subject),
                 newdata=dd, family=binomial,
                 weights=rep(1,nrow(dd)),
                 newparams=list(beta=rep(1,4),
                                theta=rep(1,10)))[[1]]
library(lme4)

Your original model:

m1 <- glmer(Y ~ A*B + (A*B|Subject), dd, family=binomial)

This gives convergence warnings, and ends up with a log-likelihood 0.003 less than the original (basically, numeric fuzz)

m2 <- update(m1, . ~ A*B+(A+A:B|Subject))

This is also equivalent:

m3 <- update(m1, . ~ A*B+(A:B-1|Subject))
all.equal(logLik(m1),logLik(m3),tolerance=1e-4) ## TRUE

If you really wanted to leave out the main effect of B you'd have to use dummy variables.

For the meaning of this: can you explain what the model would mean? In particular, the model that you intend (not what lme4 actually does) when you use (A+A:B|Subject) would have among-individual variation in the effects of A, no effects of B at the baseline value of A (assuming you are using the standard treatment contrasts), but variation in the effect of adding B to A. Another analogy would be to a model with fixed intercepts but variable slopes.

Best Answer

Related Solutions

Solved – Two factor linear mixed effect model with multiple slopes (lmer)

Solved – Random slopes for interactions without random slopes for main effects

Related Question