Solved – Bootstrapping and comparing mean distributions

bootstrapstatistical significancet-test

Is the following a reasonable approach to assess the statistical significance of the difference between two groups'

For each group
1) Subsample with replacement
2) Take the mean of the subsample
3) Repeat 10,000 times to build a distribution of means
4) Carry out a t-test to assess the difference between those two distributions

(i.e. bootstrapping to build a distribution of means)

The two datasets are very different in size (~100 vs. 100,000). The alternative approach would be to subsample from each to build two equally sized datasets, and then use a t-test on those two samples. The problem I have with this is I'm not sure if the smaller of the two sets is normally distributed (while the larger is), which may invalidate the t-test assumptions?

Best Answer

This is not how you would do a simulation test (not a bootstrap test here). What you want to do mix all the data together and then randomly redivide into two new groups find the mean of each group take the difference and plot it. Repeat lots of times, 10,000 for instance. Then you can find a p-value but counting all the results as or more extreme than your observed result (the original difference in means) and divide by 10000. This is a non-parametric version of a t-test called a permutation test. However, you could use a t-test for difference in means but there are more assumptions about the data than this test.