Solved – mixed effects and lme4: Do I need nesting

ecologyexperiment-designlme4-nlmenested datarandom-effects-model

I am analyzing data from a field experiment, and I am interested in the effects of fauna and altitude (fixed). Altitude has two levels, and at each site I have 5 blocks for the three levels of fauna (the blocks were set up to account for the heterogeneity in the soil).
I guess that the model treating block as random, in lme4, would be:

lmer(response ~ altitude  + fauna + (1|block))

However, I replicated this study in 5 mountains. So can I treat mountain as another random factor and then:

lmer(response ~ altitude  + fauna + (1|block) + (1|mountain))

Or should I nest my blocks within mountains?

lmer(response ~ altitude  + fauna + (1|mountain:block))

I see people using nested designs when they have pseudoreplicates, but that is not my really my case as all my replicates are independent.

Here's how my data looks like:

mountain   altitude   treatment block response  
m1         high       t1        b1    124.77  
m1         high       t2        b1    55.77  
m1         high       t3        b1    88.99  
m1         high       t1        b2    88.99  
m1         high       t2        b2    88.99  
m1         high       t3        b2    88.99  
m1         low        t1        b6    124.77  
m1         low        t2        b6    55.77  
m1         low        t3        b6    78.99  
m1         low        t1        b7    89.99  
m1         low        t2        b7    33.99  
m1         low        t3        b7    22.87  
m2
.
.
.

Best Answer

Since the blocks are explicitly nested, the random parts are (1|block) and (1|mountain).

altitude is a fixed term. If you want a single estimate for each level of altitude (except, of course, for the reference level), then just include altitude a single term as you did in your second suggestion above. However, you can let the effect of altitude vary between mountains, ie. a random slope model.

lmer(response ~ fauna + (1|block) + (altitude|mountain))

(You do not mention treatment, but it is part of your dataset so perhaps it should be in the model too?)

I think that a loglikelihood ratio test can be used to determine if the random slope model performs significantly better than the simpler model.

Eg.

fm1 <- lmer(response ~ altitude + fauna + (1|block) + (1|mountain))
fm2 <- lmer(response ~ fauna + (1|block) + (altitude|mountain))
anova(fm1, fm2)

Related Solutions

Solved – How to analyze this incomplete block design in R

I think you're exactly right.

Set up data like your example:

d <- expand.grid(Site=factor(1:10),rep=1:5)
d <- transform(d,Clone=factor(LETTERS[(as.numeric(Site)+1) %/% 2]))
library(lme4)
## could use development version of lme4 to simulate, but will do
## it by hand
beta <- c(2,1,3,-2,2)  ## clone effects (intercept + differences)
X <- model.matrix(~Clone,d)
set.seed(1)
u.site <- rnorm(length(levels(d$Site)),sd=1)
    d$y <- rnorm(nrow(d),
       mean=X %*% beta + u.site[d$Site],
       sd=2)

Now analyze:

m1 <- lmer(y~Clone+(1|Site),data=d)
round(fixef(m1),3)
## (Intercept)      CloneB      CloneC      CloneD      CloneE 
##       2.624      -0.034       2.504      -2.297       2.396

VarCorr(m1)
##  Groups   Name        Std.Dev.
##  Site     (Intercept) 0.0000  
##  Residual             1.6108

I don't think there's actually anything wrong, but I used a pretty big residual variance, and so in this case (probably only on a subset of replicates), lmer estimates a zero among-site variation.

Mixed Effects Model – How to Handle Nested Data with Mixed Effects Model in R

I think this is correct.

(1|Tree/Organ/Sample) expands to/is equivalent to (1|Tree)+(1|Tree:Organ)+(1|Tree:Organ:Sample) (where : denotes an interaction).
The fixed factors Treatment, Organ and Tissue automatically get handled at the correct level.
You should probably include Site as a fixed effect (conceptually it's a random effect, but it's not practical to try to estimate among-site variance with only two sites); this will reduce the among-tree variance slightly.
You should probably include all the data within a data frame, and pass this explicitly to lmer via a data=my.data.frame argument.

You may find the glmm FAQ helpful (it's focused on GLMMs but does have stuff relevant to linear mixed models as well).

Best Answer

Related Solutions

Solved – How to analyze this incomplete block design in R

Mixed Effects Model – How to Handle Nested Data with Mixed Effects Model in R

Related Question