Generalized Additive Model – Difference Between Hierarchical GAM (HGAM) and Mixed Effect GAM (GAMM)

brmsgeneralized-additive-modelhierarchical-bayesianmixed modelmultilevel-analysis

What is the difference between Hierarchical GAMs (HGAM) and Mixed GAMs (GAMM), if any?

I am looking to model time series of count data against a range of candidate explanatory variables (hoping to understand which environmental parameters could explain the fluctuations). I have a year of data at 7 sites, quite different from each other in terms of environment, but not completely independent (species can easily move from one to another).

I struggle to identify which of the two frameworks would be the most appropriate. A HGAM with grouping per site, or a GAMM with site as a random variable… Is there any key difference I am missing here?

Thank you for any advice!

Best Answer

They are the same thing; we just prefer the terminology "hierarchical" over "mixed", because the salient practical feature of these models is that they can model variation in the response that occurs at multiple levels, rather than the fact that they have fixed and random effects.

Related Solutions

Solved – Count data with mixed effects

Check out this question: Negative values for AICc (corrected Akaike Information Criterion)

The same explanation holds for BIC.

Random Effects Model – Why is There a Huge AIC Difference Between GAM and GAMM Models?

You shouldn't compare the AICs between objects fitted with different software. gam() is fitted via some fancy code fu in the mgcv package, whereas your gamm() fit is actually fitted via fancy code in the MASS (glmmPQL()) and then nlme (lme()) packages. It would be common for different constants to end up in the log likelihood.

When I read your question, I assumed you wanted to compare a GAM with no random effect to the same model with a random effect(s). To do that, fit the non-random effect model with gamm() too. For example, using @ACD's example data (from a now-deleted Answer):

set.seed(13)
x = rnorm(1000)
eff = as.factor(round(rnorm(1000)+5))
y = exp(x)*runif(1000)+as.numeric(eff)
plot(x,y)

gam_example = gamm(y ~ s(x), method="REML", family=Gamma(link="log"))
gamm_example = gamm(y ~ s(x), method="REML", random= list(eff = ~ 1), 
                    family = Gamma(link="log"))

Which gives two lines give:

> AIC(gam_example$lme)
    [1] -2.136317
    > AIC(gamm_example$lme)
[1] -1286.448

Hence there is strong support for the inclusion of the random effect and we can compare the AICs because they have been fitted via the same algorithm and code.

Technically, what @ACD shows (in a now-deleted Answer) is also incorrect. Whilst the two models are comparable in terms of both including a random effect for the eff variable, the AICs are not comparable because very different algorithms are used in the fitting:

## after running the code from @ACD's answer
> AIC(gam_example)
[1] 1721.197
> AIC(gamm_example$lme)
[1] -1286.448

The difference in AIC is meaningless; in this case, the gamm() is fitted using a penalised quasi likelihood criterion, where as the gam() is fitted using a standard REML (in this case) criterion.

Best Answer

Related Solutions

Solved – Count data with mixed effects

Random Effects Model – Why is There a Huge AIC Difference Between GAM and GAMM Models?

Related Question