Poisson Regression – Examining if Zero-Truncated Poisson and Basic Poisson Models are Nested

poisson-regressionzero inflation

I've seen plenty that discusses whether a basic Poisson regression is a nested version of a zero-inflated Poisson regression. For instance this site argues that it is, since the latter includes extra parameters to model additional zeroes, but otherwise includes the same Poisson regression parameters as the former, though the page does include a reference that disagrees.

What I can't find information about is whether a zero-truncated Poisson and a basic Poisson are nested. If the zero-truncated Poisson is just a Poisson with the extra stipulation that the probability of a zero count is zero, then I guess it sounds like they could be, but I was hoping for a more definitive answer.

The reason I'm wondering is that it will affect whether I should use Vuong's test (for non-nested models), or a more basic chi-square test based on the difference in loglikelihoods (for nested models).

Wilson (2015) talks about whether a Vuong test is appropriate for comparing the zero-inflated regression with the basic one, but I can't find a source that discusses zero-truncated data.

Best Answer

Just come across this now. To avoid confusion, I am the Wilson of Wilson(2015) referenced in the original question, which asks whether the Poisson and truncated Poisson models are nested, non nested etc. Slightly simplifying, a smaller model is nested in a larger model if the larger model reduces to the smaller one if a subset of its parameters are fixed at stated values; two models are overlapping if they both reduce to the same model when subsets of their respective parameters are fixed to certain values, they are non-nested if no matter how parameters are fixed one cannot reduce to the other. According to this definition the truncated Poisson and standard Poisson are non-nested. HOWEVER, and this is a point that seems to have been overlooked by many, Vuong's distributional theory refers to STRICTLY nested, STRICTLY non-nested, and STRICTLY overlapping. "STRICTLY" referring to the addition of six restrictions to the basic definition of nested etc. These restrictions are not exactly simple, but they do, among other things, mean that Vuong's results about the distribution of log likelihood ratios are not applicable in cases where models/distributions are nested at a boundary of a parameter space (as is the case with Poisson/zero inflated Poisson with an identity link for the zero-inflation parameter) or when one model tends to the other when a parameter tends to infinity, as is the case with the Poisson/zero-inflated Poisson when a logit link is used to model the zero-inflation parameter. Vuong advances no theory about the distribution of log likelihood ratios in these circumstances. Unfortunately here, this is the case with Poisson and truncated Poisson distributions, one tends to the other as the parameter tends to infinty, to see this, note that the ratio of the pmfs of Poisson and truncated Poisson distributions is 1-exp(-lambda) which tends to 1 as lambda tends to infinity, thus the two distributions are not stricty non-nested, or strictly anything for that matter, and Voung's theory is not applicable.

The following R code will simulate the distribution of poisson and truncated Poisson loglikelihood ratios. It requires the VGAM package.

n<-30   
lambda1<-1
H<-rep(999,10000)
for(i in 1:10000){
  print(i)
  y<-rpospois(n, lambda1)
  fit1 <- vglm(y ~ 1, pospoisson)
  fit2<-glm(y~1, family=poisson(link="log"))
  H[i]<-logLik(fit1)-logLik(fit2)
}

hist(H,col="lemonchiffon")