Generalized Additive Model – How to Interpret Concurvity and Wiggliness Induced by K Knots in GAM Models

concurvityconvergencegeneralized-additive-modelmulticollinearity

Two questions: 1. Concurvity for factors =1, is this normal? 2. How do you interpret a partial effect when the effect is linear and lines for confidence intervals are pinched at 0?

Is it okay for covariates in gam models that are entered as factors (and random effects), to result in a concurvity value of =1? One factor has 2 levels (absence or presence of boulders) and the other has 6 levels (6 years). These two covariates are entered as main effects and as mixed effects and the concurvity value for these two =1. Both factors appear to have a positive linear effect overall. Am I missing something blatantly obvious where collinearity for these factors/categorical values can be ignored?

See how covariates in bold appear in the model:
=Gam(species + s(covariate1, bs="cr") +s(covariate2, bs="cr") +s(factor1,bs="re") + +s(factor2,bs="re") +s(covariate2, factor2, bs=c("cr","re"))

How do you interpret a partial effect that is linear and SE lines are pinched at 0? One could interpret this as a positive or negative linear relationship, what else are takeaways? In my model, convergence is reached via gam.check() and K values appear fine via gam.check(). Could this be because data is sparse? Data is not patchy? Too many zeroes? It looks like wiggles can be turned on or off, smoothed over or amplified, as K is increased or decreased. This also seems to vary if a high or low K is assigned to the main or mixed effect, or both. For reporting and a tabulated comparison of models, it makes sense to go with the lower AIC, but for visual interpretation and to distill any patchiness in the data, how far does one go to change K? How does one not overfit or under-fit? In some instances K knots are negligible, but if it can induce more wiggles, how to gauge this? A general approach I take is setting K as high as I can, then reducing K as needed. Should K value be the same for a variable when it is entered as a main effect and also as a mixed effect?

Need some reassurance as I observed:

*When K=-1, this did not catch the wiggle in the data, not until manually changed.

*Wiggles observed when K for main effect is high, while K for same variable as a mixed effect is low.

*Wiggles not observed when K for main effect is high, and K for same variable as mixed effect is high

Part 2

Q1: I mean to write “ti” where: …+ ti(covariate2, factor2, bs=c("cr","re"))

Q2: I may have misstated and yes, a mixed effects model (see below). What is the rule of thumb for modifying the K value of a covariate when both a fixed and random effect? I can edit K for the fixed effect but not sure what guidance to use to edit K for the random effect. See covariate 5 and 6 below.

                 =gam(species~ 
                 s(covariate1), bs="cr",k=10)+
                 s(covariate2), bs="cr",k=10)+
                 s(covariate3), bs="cr",k=10)+
                 s(covariate4, bs="re",k=10)+
                 
                 
                 s(covariate5, bs="re",k=10)+
                 s(covariate6, bs="cr",k=50)+
                 ti(covariate6, covariate5, bs=c("cr","re"),k=c(20,10), m=1),
               
               data=data,
               family=nb(link = "log"),
               method="ML", optimizer=c("outer","bfgs"))#, select=TRUE); may add this

Best Answer

Q1: with factors that have a smallish number of levels, I believe that there will inevitably be concurvity in the parametric part of the model. I don't think this is a problem however.

Assuming the code you showed isn't pseudocode, you can't write s(covariate1, factor1, bs = c("cr", "re")) as s() doesn't allow for smooth interactions of this sort where you have separate marginal bases.

Q2: the uncertainty band given the usual definitions has to go to 0 at the point where the linear function crosses the y = 0 point as that point. That linear functions do this is well known and leads to lower coverage than is implied by the $1 - \alpha$ coverage requested. A correction has been proposed by Marra and Wood, and which is implemented in {mgcv}, which in essence applies the uncertainty in the constant term (the intercept; or overall mean) to the estimated smooths, in turn. This has the effect of creating intervals with the correct nominal coverage, but in doing so fixes the "bow-tie" interval problem associated with smooth functions that are estimated to be linear.

k = -1 just means "use the entirely arbitrary default for this type of smooth". It is set to -1 because for s() you want the number of basis functions used by default to scale with the dimension of the smooth. I believe the default for 2d or grater thin plate splines is 5^d, hence 25 for a 2d thin plate spline, etc.

I'm not sure I follow what you mean by a "mixed effect"? You might have a model that contains both random and fixed effects, in which case some might call the model a "mixed effects model" but I'm struggling to think of situations where a single covariate would enter a model as a "mixed effect".

Related Solutions

Generalized Additive Model – Understanding GAM Smoother vs Parametric Term and Concurvity Difference

The concurvity moves from the stated smooth terms to the parametric terms, which concurvity groups in total under the para column of the matrix or matrices returned.

Here's a modified example from ?concurvity

library("mgcv")
## simulate data with concurvity...
set.seed(8)
n<- 200
f2 <- function(x) 0.2 * x^11 * (10 * (1 - x))^6 + 10 *
            (10 * x)^3 * (1 - x)^10
t <- sort(runif(n)) ## first covariate
## make covariate x a smooth function of t + noise...
x <- f2(t) + rnorm(n)*3
## simulate response dependent on t and x...
y <- sin(4*pi*t) + exp(x/20) + rnorm(n)*.3

## fit model...
b <- gam(y ~ s(t,k=15) + s(x,k=15), method="REML")

Now add a linear term and refit

x2 <- seq_len(n) + rnorm(n)*3
b2 <- update(b, . ~ . + x2)

Now look at the concurvity of the two models

## assess concurvity between each term and `rest of model'...
concurvity(b)
concurvity(b2)

These produce

> concurvity(b)
                para       s(t)      s(x)
worst    1.06587e-24 0.60269087 0.6026909
observed 1.06587e-24 0.09576829 0.5728602
estimate 1.06587e-24 0.24513981 0.4659564
> concurvity(b2)
              para      s(t)      s(x)
worst    0.9990068 0.9970541 0.6042295
observed 0.9990068 0.7866776 0.5733337
estimate 0.9990068 0.9111690 0.4668871

Note that x2 is essentially a noisy version of t:

> cor(t, x2)
[1] 0.9975977

and hence the concurvity is gone up from essentially 0 in b to almost 1 in b2.

Now if we add x2 as a smooth function instead...

concurvity(update(b, . ~ . + s(x2)))

we see that the para entries return to being very small and we get a measure for the spline term s(x2) directly

> concurvity(update(b, . ~ . + s(x2)))
                 para      s(t)      s(x)     s(x2)
worst    1.506201e-24 0.9977153 0.6264654 0.9976988
observed 1.506201e-24 0.9838018 0.5893737 0.9963857
estimate 1.506201e-24 0.9909506 0.4921592 0.9943990

This is just how the function works in terms of the parametric terms; the focus is on the smooth terms.

Note: ~~you are specifying gamma but fitting using REML. gamma only affects GCV and UBRE/AIC methods of smoothness selection, so you can remove this argument as it is having zero effect on the model fits.~~ From version 1.8-23 of mgcv, the gamma argument no also affects models fitted using REML/ML, where smoothness parameters are selected BY REML/ML as if the sample size was $n/\gamma$ instead of $n$.

Generalized Additive Model – Concurvity and GAMMs Analysis in R Using MGCV

This was the answer I found:

One is doing X1 = f(X2) + f(X3) + f(X4) and the other... X1 = f(X2), X1 = f(X3), X1 = f(X4)...etc

The first one, with full = TRUE is similar to the VIF. The second one, with full = FALSE is like the Pearson correlation; it does pair-wise stuff. They are NOT supposed to match. High concurvity (=1.0) of parametric factors with low levels doesn't matter and other variables with high-ish concurvity can be taken out of the model.

Best Answer

Related Solutions

Generalized Additive Model – Understanding GAM Smoother vs Parametric Term and Concurvity Difference

Generalized Additive Model – Concurvity and GAMMs Analysis in R Using MGCV

Related Question