Solved – Linear regression or poisson regression for predicting incidence rates

linearpoisson-regressionregression

I am trying to predict incidence rates of prostate cancer based on different age groups and time (in years). I am not sure which regression method to take. Some have suggested linear regression while others suggest poisson regression as I am dealing with counts (incidence rates).
Appreciate your feedback! πŸ™‚

Best Answer

Poisson regression is likely what you are looking for because it models count data which is what you have.

One reason this is better than linear regression is because linear regression assumes errors are normally distributed around the mean and consequently your model allows negative counts. For example, if linear regression predicts an expected incidence rate of 0 then your model says an incidence rate of 1 is just as likely as -1. This is especially pertinent to you since your counts are incidence rates of prostate cancer and they will presumably be small.

As an additional note, if your data has lots of zeros look into zero inflated Poisson models. A lot of people don't develop prostate cancer and I wonder if your data includes those people.

Related Question