Predictions – Subject Specific vs. Population Average Predictions

interpretationmarginal-modelmixed model

I am in doubt whether in my thesis I should report on the subject specific predictions of the probability to respond with an 'I don't know' answer, or the population average. Consider for example the following graphs (I have not gotten them smaller yet):

enter image description here

enter image description here

They both have the same conclusion that the probability doubles for females. However, in terms of frequencies this means on average in the population I will have 3.3% missingness for females and 2.5 for males. For the average female individual though (in the median clusters= interviewed by the median interviewer, in the median country), I will have 1.9% missingness and 1.5% for this average male individual. I think the population average is more informative for my cause: evaluate whether missingness is related to gender and whether the rates are problematic. However, reading the following passage (Allison, P D (2009) Fixed Effects Regression Models, Quantitative Applications in the Social Sciences Series, Sage, Thousand Oaks, California):

“if you are a doctor and you want an estimate of how much a statin
drug will lower your patient’s odds of getting a heart attack, the
subject-specific coefficient is the clear choice. On the other hand,
if you are a state health official and you want to know how the number
of people who die of heart attacks would change if everyone in the
at-risk population took the stain drug, you would probably want to use
the population–averaged coefficients. Even in the public health case,
it could be argued that the subject-specific coefficient is more
fundamental"

I can't seem to translate this to my specific situation. Anyone have any advice on my situation?

Best Answer

You cannot reach the conclusion that "the probability doubles for females". But your figures are a little misleading since the $y$-axes do not start from zero. It also seems that the percentages are just approximations from the figures.

We can always interpret regression coefficient as the association between $X$ and $Y$ on average.

  • Subject specific coefficient is the association between $X$ and $Y$ holding constant the random effects, and it describes how an individual's probability of missing depends on gender. There is no concept of "average female/male individual" here.
  • Population average coefficient is the association between $X$ and $Y$ based on averaging over the random effects, rather than the typical one for an individual subject. According to your cause, the population average estimate fits your need.
Related Question