Solved – Variance of Mean of Samples from Unknown Distribution

unbiased-estimatorvariance

I have a bunch of IID samples from a random variable with unknown mean and unknown variance. I now need to know the variance of the average of these samples.

I found some references. In decreasing order of helpfulness: 1, 2, 3. These give a formula for the variance in terms of the variance of the original distribution. What I haven't been able to find is a direct solution for the case of the original distribution having unknown variance. This similar question may be of use, but I am unsure how to apply it.

My first thought is to try to use the sample variance as an estimate for the population variance. My question is, is that the right way to do it? Is so, how do I correct for any differences? If not, what is the right way?

Best Answer

The question you think it might be "similar", is not, because it is concerned mainly with the distribution/variance of the sample variance itself.

You cannot know the variance of the sample mean (i.e. of the "average of these samples" - I guess by "sample" you mean a single observation) because it depends on a usually unknown parameter (the population variance). So indeed, what you can do is use in its place the estimated variance, i.e. the sample variance, corrected for bias:

$$s^2 = \frac1{n-1} \sum(x_i-\bar x)^2, \;\; E(s^2) = \sigma^2$$

As for the question

How do I correct for any differences?

The answer is "you cannot" -and this is the whole point of statistics. If you could "correct for any differences", it would imply that you already knew the true variance, isn't it so?