Solved – Calculating the Log Likelihood of models in glmnet

devianceglmnetlikelihood

glmnet() returns a lambda sequence fitobj$lambda and I would like to calculate the log likelihood of the models (LL_model) defined by the lambda sequence.

The obvious solution is to just take the parameters and calculate the LL of each model manually. However, this not very elegant and very slow. Therefore, I am trying to calculate the LL_model from the deviance-measures that are returned by glmnet.

the glmnet-object gives me:

1) nulldev = 2*(LL_sat – LL_null), where LL_sat is the saturated model and LL_null is the NULL model (one value)

2) dev.ratio = 1 – dev.model/nulldev, where dev.model is the deviance of the model at hand (k values, for k lambda values/models)

3) and glmnet:::deviance.glmnet gives me the dev_model = (1-dev.ratio)nulldev dev.model = 2(LL_sat – LL_model) (k values for, k lambda values/models)

To calculate the LL for each model, I would do the following:

1) Calculate LL_null

2) Solve (1) for LL_sat and calculate LL_sat (one value)

3) solve (3) for LL_model and calculate LL_model (k vector)

Now, my two questions are:

1) How is this NULL model defined? The glmnet-manual says "The NULL model refers to the intercept model." But I am a bit puzzled by the fact that there is only one NULL model and nulldev for the whole lambda sequence. Based on which lambda is this intercept model calculated? I have the feeling I am missing something.

2) Does anybody see an easier way to calculate the LL of each model in the sequence? The final goal is to calculate the (E)BIC of each model. I am surprised by the fact that turns out to be so tedious.

Any help would be greatly appreciated!

Best Answer

Here is one sure way to compute the log-likelihood of logistic (and probit) regressions, no matter how it is estimated. All you need is the dependent dummy and fitted values.

I tested the function on a model with just a constant and it seems to work well. It is NOT that hard to compute a log likelihood -- so just write it down and compute it by hand. Then, just always use the same function everywhere and your computations will be coherent.

I really don't know why logLik is not yet compatible with glmnet. Penalized pseudo-R2 are often used measures of fit, you need it for information criterion and it just seems more convenient to have that value to compute plenty of test statistics. Anyway, here it is for future reference:

 logit_logLik <- function(dummy,fitted_values){
   # Description: Computes log-likelihood of a fitted
   # logit model 

   # Format variables
   y <- as.matrix(dummy)
   p <- as.matrix(fitted_values)
   # Adjust dimensions
   skip <- dim(y)[1] - dim(p)[1]
   y    <- as.matrix(y[-c(1:skip),])

   # Compute log-likelihood
   item <- sapply(1:dim(y)[1], function(i)
                          y[i,]*log(p[i,]) + (1-y[i,])*log(1-p[i,]))
   return(sum(item))
 }

Related Solutions

Solved – How is the intercept computed in GLMnet

I found that the intercept in GLMnet is computed after the new coefficients updates have converged. The intercept is computed with the means of the $y_i$'s and the mean of the $x_{ij}$'s. The formula is siimilar to the previous one I gave but with the $\beta_j$'s after the update loop : $\beta_0=\bar{y}-\sum_{j=1}^{p} \hat{\beta_j} \bar{x_j}$.

In python this gives something like :

        self.intercept_ = ymean - np.dot(Xmean, self.coef_.T)

which I found here on scikit-learn page.

EDIT : the coefficients have to be standardized before :

        self.coef_ = self.coef_ / X_std

$\beta_0=\bar{y}-\sum_{j=1}^{p} \frac{\hat{\beta_j} \bar{x_j}}{\sum_{i=1}^{n} x_{ij}^2}$.

Solved – Exact definition of Deviance measure in glmnet package, with crossvalidation

In Friedman, Hastie, and Tibshirani (2010), the deviance of a binomial model, for the purpose of cross-validation, is calculated as

minus twice the log-likelihood on the left-out data (p. 17)

Given that this is the paper cited in the documentation for glmnet (on p. 2 and 5), that is probably the formula used in the package.

And indeed, in the source code for function cvlognet, the deviance residuals for the response are calculated as

-2*((y==2)*log(predmat)+(y==1)*log(1-predmat))

where predmat is simply

predict(glmnet.object,x,lambda=lambda)

and passed in from the encolsing cv.glmnet function. I used the source code available on the JStatSoft page for the paper, and I don't know how up-to-date that code is. The code for this package is surprisingly simple and readable; you can always check for yourself by typing glmnet:::cv.glmnet.

Best Answer

Related Solutions

Solved – How is the intercept computed in GLMnet

Solved – Exact definition of Deviance measure in glmnet package, with crossvalidation

Related Question