Solved – Minimum population size for chi-squared test

chi-squared-testsample-size

I'm analyzing data from an experiment in which two independent groups were exposed to an experimental setup without and with treatment.

I am testing whether the treatment changed the second group's behaviour by performing a chi-squared test that compares group 2 (the observed) vs group 1 (the expected). The result indicates there is a significant change in behaviour X² p-value < 0.00014.

Now, I am trying to test subgroups to understand better the change, i.e., looking at gender, age, and other self reported metrics.

My question is, given that group 2 N=40 if I look at age for instance I find people in their 20s and their 60s show significant change but other age groups don't. However people in their 20s N=12 and people in their 60s N=5. Is there a heuristic / rule that says there is a minimum number of people needed to consider a result significant? For instance anything below N=5 cannot be considered significant or anything below N=20% of the population?

EDIT: Just to clarify, I am doing a chi-squared test of independence (between group 1&2) not a chi-square goodness of fit test.

EDIT 2: With this edit I consider the question closed. None of the answers / comments gave me a definitive solution, which I believe says more about the question than the answers. I was hoping for a definitive answer along the lines you need at least 5 people or 20% of your sample. It seems the answer is less direct as it is sensitive to many factors.

Best Answer

For small sample sizes, use Fisher's exact test, because the $\chi^2$ test sampling statistics has only approximately the $\chi^2$ distribution, and this approximation is problematic for small sample sizes.

While lower sample size decreases the power of the test, the p-values (and not the sample size) are indicators of the statistical significance. A significant p-value stays significant whatever the sample size; the sample size has been taken care of through the calculation of the test statistic.

However, someone might claim that a small sample size is more likely to be biased. This is not necessarily true, but I think there might exist a correlation between the study sample size and whether the data was collected in an unbiased way as it should.

Related Question