Solved – Different variance-covariance matrices of random effects per fixed-effect group in lme4

covariance-matrixlme4-nlmemixed modelr

Consider the following lme4 model: Y ~ X*Condition + (1+X|Trial).

Here Trial is nested in a binary Condition. This model calculates the variance of random intercept, variance of random slopes and the correlation of random intercepts and random slopes. I would like to calculate all these values but separately for Condition A and Condition B. How can I do that?

Additional experimental details

The experiment we conducted is a two-choice task where people have to choose the correct answer. There are two conditions in the task (condition A and B), one of which makes the task harder. All participants go through sets of trials for both condition A and condition B. All trials are coded in a single variable named Trial – A trials are named A1, A2, A3, and B trials B1, B2, B3… We want to predict Y (binomial variable) from X (continuous variable) while controlling for the random effects in different trials.

Best Answer

Thanks to @amoeba and using @BenBolker's brief remark here

Extensions such as allowing different residual variances or different variance-covariance matrices of random effects per (fixed-effect) group can be achieved, somewhat clunkily, by using the dummy() helper function to construct an indicator variable to multiply by individual levels of interest.

we got to the bottom of the problem.

The solution is following:

Y ~ X*Condition + 
    (X*Condition | Subject) + 
    (0 + dummy(Condition, "A") + X:dummy(Condition, "A") | Trial) + 
    (0 + dummy(Condition, "B") + X:dummy(Condition, "B") | Trial)

The summary(model) in random effects yields:

Groups  Name                        Variance Std.Dev. Corr             
 subject (Intercept)                 0.89343  0.9452                    
         X                           0.11695  0.3420   -0.85            
         ConditionB                  0.66731  0.8169   -0.33  0.06      
         X:ConditionB                0.07391  0.2719    0.34 -0.05 -0.47
 Trial   dummy(Condition, "A")       0.63854  0.7991                    
         dummy(Condition, "A"):X     0.09372  0.3061   -0.76            
 Trial.1 dummy(Condition, "B")       0.88833  0.9425                    
         dummy(Condition, "B"):X     0.12175  0.3489   -0.60

which now makes perfect sense, because only correlations that can be calculated are between intercepts and slopes for a certain condition (because they are estimated for the same trials). Correlations between Conditions are senseless.

Furthermore, coef(model)$Trial now shows logical values:

      dummy(Condition, "A")   dummy(Condition, "A"):X    dummy(Condition, "B")   dummy(Condition, "B"):X      (Intercept)          X           ConditionB    X:ConditionB
A1                0.9198822                 0.0209849                        0                         0       2.703544   -0.9929765          -0.07102448        0.2415836
A2               -1.3029020                 0.3894812                        0                         0       2.703544   -0.9929765          -0.07102448        0.2415836
A3                1.1294702                -0.2475288                        0                         0       2.703544   -0.9929765          -0.07102448        0.2415836
B1              0.000000000              0.0000000000               1.21725268              -0.305314643       2.703544   -0.9929765          -0.07102448        0.2415836
B2              0.000000000              0.0000000000               0.88317976              -0.209529267       2.703544   -0.9929765          -0.07102448        0.2415836
B3              0.000000000              0.0000000000               0.27859781              -0.065708851       2.703544   -0.9929765          -0.07102448        0.2415836

Fixed effects are the same for all trials
dummy(Condition, "A") intercepts and dummy(Condition, "A"):X slopes are calculated only for Condition A, in Condition B trials they are estimated to be 0, and vice versa.

N.B. When specifying random effects for this purpose, it is important to:

specify random effects for different groups of trials independently, not under the same |Trial. If you don't do that, lme4 will estimate random effects for all trials, not just for the given condition.
include 0, so as to prevent lme4 from including a general random intercept across all trials.

Related Solutions

Solved – Nested random effects model in lme4

As you've described the study, trial is nested within block, but block isn't nested within subject. That is, trial 3 is a different question in blocks 1 and 2, but block 3 is the same set of 8 questions for each subject. Hence, a natural way to structure the random effects would be to have one random intercept effect per subject plus 8N random intercepts nested into N batches of 8, where N is the number of blocks. Or, if N is small, you could treat block as a fixed effect and have a single batch of 8N per-trial random intercepts (plus the aforementioned per-subject intercepts).

You asked what the difference is between fancy random-effects structures like these and Cartesian-producting all the dummy variables in a study together to get one big batch of random effects (new variable). The difference is that each batch of random effects has its variance estimated separately, and that orthogonal effects are obliged to behave consistently. (And, of course, the more random effects you have, the harder it is to estimate each.) To use a simpler example, imagine you have a model where each subject is a child and you have dummy variables for the child's father and mother. Assume the dataset has a lot of half-siblings in it, so that mother and father effects are distinguishable. If you say

lmer(outcome ~ 1 + fixed effects + (1|Mother) + (1|Father))

then the model is allowed to believe, e.g., that the effects of father vary more than the effects of mothers. On the other hand, if you make each mother–father pair its own value of a single dummy variable, and say

lmer(outcome ~ 1 + fixed effects + (1|new variable))

then new variable gets only one variance. Also, whereas this model allows for arbitrarily complicated interactions between mother and father, the first model postulates that the effects are purely additive. And if $M$ is the number of mothers and $F$ the number of fathers, the first model has $M + F$ different random effects and the second has $MF$.

Finally, I don't think it's wise to consider RT and Correct in completely separate models. Shouldn't whether people answer a question correctly be related to how quickly they answer it?

Best Answer

Related Solutions

Solved – Nested random effects model in lme4

Related Question