Solved – Estimating mean of Normal with unknown variance and then predict the future observation

bayesiannormal distributionprobabilityself-studyt-distribution

I am trying to estimate population mean of 9 observations when the variance is unknown. I marginalized the posterior and understand that the t- distribution would give me the distribution of population mean. I am stuck at this point. Normally, If I had to estimate some thing I would generate 1000 or more random samples of the given distribution and then generate point or interval estimates for it's values. But the T-distribution has confused me. Matlab's tpdf generates only 8 samples, but when I sum them up they do not add up to 1 which looks weird so is it generating actual values? If these are actual values then where is the distribution? how do I estimate mean from it (Substitute these values in the standardization formula to find values of mean?).

PS: I have been studying stats recently and though I understand the mathematical part of it, I feel miserable when doing simulation in matlab. So I would appreciate any pointers twards learning the computational side of it.

EDIT: I understand the mathematical or derivation part of it. It is the computational simulation that confuses me. I use tpdf for using t distribution but it needs data and degree of freedom. and then how do I go about finding the point estimate of mean in matlab. Aso tpdf needs to be translated towards my data values.

Best Answer

Quoting from our Bayesian Essentials with R book,

if $\mathscr{D}_n$ denotes a normal $\mathscr{N}\left(\mu,\sigma^{2}\right)$ sample of size $n$, if $\mu$ has a prior equal to a $\mathscr{N}\left(0,\sigma^{2}\right)$ distribution, and $\sigma^{-2}$ an exponential $\mathscr{E}(1)$ distribution, the posterior is given by \begin{align*} \pi((\mu,\sigma^2)|\mathscr{D}_n) &\propto \pi(\sigma^2)\times\pi(\mu|\sigma^2)\times f(\mathscr{D}_n|\mu,\sigma^2)\\ & \propto (\sigma^{-2})^{1/2+2}\, \exp\left\{-(\mu^2 + 2)/2\sigma^2\right\}\\ & \times (\sigma^{-2})^{n/2}\,\exp \left\{-\left(n(\mu-\overline{x})^2 + s^2 \right)/2\sigma^2\right\} \\ &\propto (\sigma^2)^{-(n+5)/2}\exp\left\{-\left[(n+1) (\mu-n\bar x/(n+1))^2+(2+s^2)\right]/2\sigma^2\right\}\\ &\propto (\sigma^2)^{-1/2}\exp\left\{-(n+1)[\mu-n\bar x/(n+1)]^2/2\sigma^2\right\}\,.\\ &\times (\sigma^2)^{-(n+2)/2-1}\exp\left\{-(2+s^2)/2\sigma^2\right\}\,. \end{align*} Therefore, the posterior on $\theta$ can be decomposed as the product of an inverse gamma distribution on $\sigma^2$, $$\mathscr{IG}((n+2)/2,[2+s^2]/2)$$ which is the distribution of the inverse of a gamma $$\mathscr{G}((n+2)/2,[2+s^2]/2)$$ random variable and, conditionally on $\sigma^2$, a normal distribution on $\mu$, $$\mathscr{N} (n\bar x/(n+1),\sigma^2/(n+1)).$$ The marginal posterior in $\mu$ is then a Student's $t$ distribution $$ \mu|\mathscr{D}_n \sim \mathscr{T}\left(n+2,n\bar x\big/(n+1),(2+s^2)\big/(n+1)(n+2)\right)\,, $$ with $n+2$ degrees of freedom, a location parameter proportional to $\bar x$ and a scale parameter almost proportional to $s$.

From this distribution, you get the expectation $n\bar x/(n+1)$ that acts as your point estimator of $\mu$. And a credible interval on $\mu$ $$\left(n\bar x/(n+1)-((2+s^2)/(n+1)(n+2))^{1/2}q_{n+2}(\alpha),n\bar x/(n+1)+((2+s^2)/(n+1)(n+2))^{1/2}q_{n+2}(\alpha)\right)$$where $q_{n+2}(\alpha)$ is the $t_{n+1}$ quantile.

Related Solutions

Solved – Credible interval for Bayesian posterior of variance and mean, and posterior predictive of normal

Fixed it. Edited code now works properly, and my new results are:

[1] "% of means within 95% CI:"
[1] 0.95325
[1] "% of variance within 95% CI:"
[1] 0.94997
[1] "% of observations within posterior predictive CI:"
[1] 0.94891

Hopefully this code will help someone else.

Solved – Why not use the T-distribution to estimate the mean when the sample is large

Just to clarify on relation to the title, we aren't using the t-distribution to estimate the mean (in the sense of a point estimate at least), but to construct an interval for it.

But why use an estimate when you can get your confidence interval exactly?

It's a good question (as long as we don't get too insistent on 'exactly', since the assumptions for it to be exactly t-distributed won't actually hold).

"You must use the t-distribution table when working problems when the population standard deviation (σ) is not known and the sample size is small (n<30)"

Why don't people use the T-distribution all the time when the population standard deviation is not known (even when n>30)?

I regard the advice as - at best - potentially misleading. In some situations, the t-distribution should still be used when degrees of freedom are a good deal larger than that.

Where the normal is a reasonable approximation depends on a variety of things (and so depends on the situation). However, since (with computers) it's not at all difficult to just use the $t$, even if the d.f. are very large, you'd have to wonder why the need to worry about doing something different at n=30.

If the sample sizes are really large, it won't make a noticeable difference to a confidence interval, but I don't think n=30 is always sufficiently close to 'really large'.

There is one circumstance in which it might make sense to use the normal rather than the $t$ - that's when your data clearly don't satisfy the conditions to get a t-distribution, but you can still argue for approximate normality of the mean (if $n$ is quite large). However, in those circumstances, often the t is a good approximation in practice, and may be somewhat 'safer'. [In a situation like that, I might be inclined to investigate via simulation.]

Best Answer

Related Solutions

Solved – Credible interval for Bayesian posterior of variance and mean, and posterior predictive of normal

Solved – Why not use the T-distribution to estimate the mean when the sample is large

Related Question