Solved – Mixed model repeated measures in R – specific questions

covariancemixed modelrrepeated measuressas

I am using a mixed model repeated measures analysis and have a few questions in terms of how to structure the model.

My setup is 24 individual plots, grouped into 6 blocks. There are 4 treatments in 2 levels: i.e. 2 different treatment types, 4 treatments in total: No treatment, treatment 1, treatment 2, interaction between treatment 1 and 2. Each treatment is replicated once per block, i.e. 6 times in total.

Measurements are repeated over time. I.e. the same plot is repeatedly measured at different time points. All plots are measured once for each time point. The time intervals between each measurement campaign (time point) are not equal (i.e. could vary from a few days to several weeks), which needs to be accounted for in the analysis, if possible. Most likely, the variances are also not equal for each measurement campaign.

I would put Block as a random factor, but it could also be added as a fixed effect.

So to sum up:

Dependent variable – y

Fixed effects – Treatments T1 and T2, separately and interaction

Random effect – Block (could also be fixed?)

Subject – PlotID, individual plots (which are repeated through time)

Repeated measures – Time point (given as day-of-year)

I tried the following syntax for mixed model repeated measures in R (lme4 package, lmerTest for p-values):

a. lmer(y ~ T1 * T2 * Time + (1|Block), data=dataset)

But I understand there are other ways as well:

b. lmer(y ~ T1 * T2 * Time + (1+Time|Block), data=dataset)

c. lmer(y ~ T1 * T2 * Time + (1|Block/Time), data=dataset)

d. lmer(y ~ T1 * T2 * Time + (1|Block:Time), data=dataset)

e. lmer(y ~ T1 * T2 * Time + (1+Block|PlotID), data=dataset)

Here are my questions:

Which model would be most correct in my case? What are the differences between them?
I should note that when I ran models b-e, my model did not always converge. What would be the reason for this?
Is this the correct way of doing repeated measures – to simply include Time as a fixed factor? Or is there another way of structuring this type of analysis?
Should Time (day-of-year) be classified as factor, numeric or integer?
Should I specify PlotID anywhere (as in e)? I need the model to recognise that I have 24 individual plots repeated over time. At the same time, I want to correct for the noise arising from the differences between blocks. Each combination of treatment and block is unique (i.e. 24 in total).
What covariance structure does R use?
How are the degrees of freedom calculated?
I previously set up the analysis in SAS Enterprise. Here, I ran into the problem that I got no significant results at all, even though it looks like there should at least be some significance according to my graphs. Also, I get completely different results in R and SAS. What is the reason for this?

Best Answer

I try to answer, or at least give hints, to those questions I feel a little bit familiar with. I'm no statistics expert, but this is what I understood so far about longitudinal data analysis and growth models using lme4.

I found this page quite helpful to decide how to specify the formula depending on the design. Your random part would probably look like (1 + time | Block/PlotID) - however, I'm not sure if you even would want to include the interaction in the random parts as well.

Maybe these link are also helpful:

Question 2) For repeated measure (or growth models), time should be included as fixed and random factor.

Question 3) If you use time as factor, you may get an error (like number of observations <= number of random effects or so). In such cases, use argument control = lmerControl(check.nobs.vs.nRE="ignore") in your lmer-call. See this post for more details. For more than two or three time points, I would include time as numeric.

Question 4) What do you exactly mean by that? You have to transform your data into long format, thus having a subject-ID-variable which repeats its values for each time point (i.e. each subject is represented max. once per time points).

Related Solutions

Solved – Repeated measures, mixed model, ANOVA or…

Don't forget the SUBJECT random effect. That's the one that makes it into a repeated measures/mixed model design. You are correct that TYPE and TREATMENT are fixed, but how to treat TIME will depend on what assumptions you want to make about time. The simplest thing to do would just be to leave it out and treat the five measures as independent subsamples on each individual, but more generally, you could treat TIME as a fixed effect/repeated measure and model the correlation between each time point.

The preferred terminology in these models can differ depending on your field; as the same model can often be described several ways, so it's not really a matter of deciding whether it's a repeated measure/mixed model/ANOVA; you could probably use any of those terms to describe the model you end up with. What's more important is to define what terms you want to include in the model and how you want them to be able to vary.

Solved – Using lmer for repeated-measures linear mixed-effect model

I think that your approach is correct. Model m1 specifies a separate intercept for each subject. Model m2 adds a separate slope for each subject. Your slope is across days as subjects only participate in one treatment group. If you write model m2 as follows it's more obvious that you model a separate intercept and slope for each subject

m2 <- lmer(Obs ~ Treatment * Day + (1+Day|Subject), mydata)

This is equivalent to:

m2 <- lmer(Obs ~ Treatment + Day + Treatment:Day + (1+Day|Subject), mydata)

I.e. the main effects of treatment, day and the interaction between the two.

I think that you don't need to worry about nesting as long as you don't repeat subject ID's within treatment groups. Which model is correct, really depends on your research question. Is there reason to believe that subjects' slopes vary in addition to the treatment effect? You could run both models and compare them with anova(m1,m2) to see if the data supports either one.

I'm not sure what you want to express with model m3? The nesting syntax uses a /, e.g. (1|group/subgroup).

I don't think that you need to worry about autocorrelation with such a small number of time points.

Best Answer

Related Solutions

Solved – Repeated measures, mixed model, ANOVA or…

Solved – Using lmer for repeated-measures linear mixed-effect model

Related Question