R – Reporting Results from Linear Mixed Models: Changing Significance of Interaction Terms

lme4-nlmerreporting

I've used linear mixed models to test if factors genotype and sex influence colon length, while including batch as a random effect. I first ran the testvalue ~ genotype + SEX + (1 | BOX) and got the following results, with sex being significant:

Linear mixed model fit by REML. t-tests use Satterthwaite's method ['lmerModLmerTest']
Formula: value ~ genotype + SEX + (1 | BOX)
   Data: ColonLength.new

REML criterion at convergence: 94.1

Scaled residuals: 
     Min       1Q   Median       3Q      Max 
-1.69433 -0.55283  0.00537  0.61300  2.01439 

Random effects:
 Groups   Name        Variance Std.Dev.
 BOX      (Intercept) 0.1346   0.3669  
 Residual             0.2819   0.5309  
Number of obs: 49, groups:  BOX, 20

Fixed effects:
              Estimate Std. Error      df t value Pr(>|t|)    
(Intercept)     8.0512     0.1832 16.1116  43.943  < 2e-16 ***
genotypemGlu5   0.0914     0.2281 16.3917   0.401  0.69379    
SEXF           -0.7549     0.2280 16.5801  -3.310  0.00425 ** 
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Correlation of Fixed Effects:
            (Intr) gntyG5
genotypmGl5 -0.515       
SEXF        -0.518 -0.142

After including a sex-genotype interaction with the formula value ~ genotype*SEX + (1 | BOX) sex is no longer significant

    Linear mixed model fit by REML. t-tests use Satterthwaite's method ['lmerModLmerTest']
Formula: value ~ genotype * SEX + (1 | BOX)
   Data: ColonLength.new

REML criterion at convergence: 92.7

Scaled residuals: 
     Min       1Q   Median       3Q      Max 
-1.62511 -0.66926  0.05283  0.58236  2.10327 

Random effects:
 Groups   Name        Variance Std.Dev.
 BOX      (Intercept) 0.1362   0.3691  
 Residual             0.2800   0.5291  
Number of obs: 49, groups:  BOX, 20

Fixed effects:
                   Estimate Std. Error      df t value Pr(>|t|)    
(Intercept)          7.9527     0.2055 15.2546  38.707   <2e-16 ***
genotypemGlu5        0.3299     0.3196 14.4285   1.032    0.319    
SEXF                -0.5174     0.3185 18.0472  -1.625    0.122    
genotypemGlu5:SEXF  -0.4888     0.4570 15.6767  -1.070    0.301    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Correlation of Fixed Effects:
            (Intr) gntyG5 SEXF  
genotypmGl5 -0.643              
SEXF        -0.645  0.415       
gntyG5:SEXF  0.450 -0.699 -0.697

Should I report both? I.e., something like "the main effect of sex was significant (β = -.75, SE= .23,p= .005), but was no longer significant when the interaction between sex and genotype was included (β = .52, SE= .32, p = .12)"? Is this an appropriate way to report the results? (I know there are also people who recommend reporting the fixed effect estimates, the confidence interval, and the strength of the effect, and still others who somehow report LMM like "F(df,dferror) = F-value, p = p-value". Which is preferable?)

If there is a main effect, is it appropriate to then go and look at the interaction between terms? Or is that something I should primarily be doing if there's not a main effect observed?

Apologies if these questions are inane – I don't have much experience with statistics and have been kind of thrown in the deep end. I'd really appreciate any help.

Best Answer

In your second model, the effect of sex shown (-0.5174) is the estimate of the effect of sex at the reference level of genotype. The estimate for the sex effect at the mGlu5level is -0.5174-0.448= -0.965. So when the interaction is in the model there is no longer a main effect of sex reported, just different estimates for different genotypes, suggesting a greater effect of sex for the mGlu5 genotype. Yet the p-value for the interaction is 0.301, so there isn't much evidence from the data that those effects are genuinely different in the population.

Now, it probably makes more sense to think about the effect of genotype varying by sex than the effect of sex varying by genotype, (although mathematically they are the same thing). Still, there is little evidence from your data that there is an interaction effect present, so I would probably report the main effects (first model) as your best estimates of the effects of sex and genotype while mentioning that the second model suggests little evidence for an interaction (although it doesn't rule it out, interactions are difficult to detect).

Related Solutions

R Mixed Model – Interaction Term in a Linear Mixed Effect Model in R

Here's what I would do:

First, I would have a look here on how to specify the random term in your model1. I am not quite sure what you are trying to fit. There is also a lot of info on linear mixed effects models here on CV. Click on the lme4-nlme tag, which you also provided. It would also help if you could provide an example dataset, or at least the structure of your data.

Then, you most likely only need one model, which is presumably in the form of:

my_model <- lmer(carbon ~ species + landuse + species : landuse + (1|site), data = mydata)

I specified the random effect to be + (1|site), because you said:

Study sites are included as the random effect in the model.

To get the ANOVA table you can either do:

library(car)
Anova(my_model)

or:

library(afex)
mixed(carbon ~ species + landuse + species : landuse + (1|site), data = mydata)

or instead of running lmer() through the lme4 package, load the lmerTest package and run:

my_model <- lmer(carbon ~ species + landuse + species : landuse + (1|site), data = mydata)
anova(my_model)

This will give you the ANOVA table you probably need eventually. Make sure to have a look at those functions and their arguments (?Anova, ?mixed, ?lmerTest::anova).

I don't quite understand why would want to exclude species if the interaction is significant and run separate models for all species?!

However, if your main effects are not significant you could consider tossing them out and re-running the model with the interaction only. However, if one or both main effects are significant, I would keep them both in the model and report this together with a potential significant interaction.

In any case, if you have a significant interaction you should focus on interpreting the interaction and not the main effects since their interpretation could now be misleading. The interpretation of the interaction should start by visualizing it. You could do this for example using the emmip() function in the emmeans package:

library(emmeans)
emmip(my_model, landuse ~ species)

Regarding the adjustment of p-values, you only need to do that if you are following up with post-hoc tests.

This could be done with the emmeans() function (also from the emmeans package):

emmeans(my_model, pairwise ~ species : landuse)

Mixed Model Reporting – How to Report Results with and without Interaction Term

First of all, yes, you are correct in the way you are interpreting the fixed effects.

However, note that we are only dealing here with fixed effects. Your model also has random effects, and in particular you have random coefficients for treatment which means that each subject has their own individual treatment effect. The calculations with the fixed effects therefore represent averages across all subjects.

Although the findings are largely the same, I would present your findings based on the model with the interactions. We can view the model with no interactions with this plot:

While the model with interactions looks like this:

Formally, you can do a likelihood ratio test using the anova() function to test which model is better (you will have to re-run your models using the REML=FALSE option because likelihood-based methods cannot be used to compare models with different fixed effects).

I would focus on treatment 3 being associated with higher values of amp.sqrt at time 7 and further at time 8, and these differences are also statistically significant at the 5% level. Treatment 3 is associated with lower values at time 6 (although this difference is not statistically significant at the 5% level). Also, there is very little differences between treatments 1 and 2 at all time points (and these are also not statistically significant). Moreover there appears to be no time trend for treatments 1 and 2. You might be interested in a test of whether treatment 2 is different between years 6 and 8 since there is a small upward trend. Personally this looks negligible to me, compared with treatment 3 but if you wanted to test this you could use a post-hoc test such as Dunnett's.

Best Answer

Related Solutions

R Mixed Model – Interaction Term in a Linear Mixed Effect Model in R

Mixed Model Reporting – How to Report Results with and without Interaction Term

Related Question