Solved – Statistical test in R for count data

anovabiostatisticscount-datapoisson distributionr

I am struggling with using the correct test for count data in R. The dataset I have is the number of nymphs produced by three aphid species on wheat.

So this is count data with one response variable (number of nymphs) and one explanatory variable with three levels (i.e. three different species).

I want to test whether there are any differences between the three species with the number of nymphs they produce.

I think that I should not use an ANOVA with count data. I have tried a GLM with QuasiPoisson, but I am struggling to interpret the output and be sure that I have used the correct test. Essentially I want to equivalent for a one-way anova but for count data.

Best Answer

I suggest two consideration before modelling.

  1. Are you or not comparing fecundities of species through their lifetime? Poisson distribution might not be the case here if you do compare in this way, because species might have different lifespan on the same host plant, Poisson however refer to occurrence of events at the same time interval. Non-parametric comparison can be used here but please try to have enough replicates because generally non-parametric methods are more conservative.

  2. If hypothesis on Poisson is met here, it will be better to check whether, overall, the average value of response variable equal to their dispersion. You might consider to use "Negative binomial regression" if your data is over-dispersed.

Related Question