Conditional Probability – Calculating Conditional Expectation of Gamma Distribution on Sum

conditional probabilitygamma distribution

I have a process which consists of a number of events and what is known is the timings between the events. What I'm trying to determine is a distribution that allows me to determine a likelyhood that a new sample fits the distibution.

The issue is mainly that if you have lots of samples you can approximate the result using a standard gaussian and use the mean and standard deviation. But if you only have a handful of samples, the gaussian does not accurately represent the situation.

From what I've read it is common to model waiting times using the gamma distribution. Looking at how the process evolves it looks like it matches well. The unknown is the scale parameter, since the shape parameter I think should be the number of samples. What I've worked out so far is that given the timings $X_1 … X_N$ you can say:

$$ \sum_{n=1}^N X_n \sim \Gamma(N,\theta) $$

($N$ is known and fixed)

However, $\theta$ is unknown, but the maximum likelihood parameter is the average of the $X_i$ (according to wikipedia anyway).

My question is, can I use this to estimate a distribution for $X_i$, that is, since the $X_i$ are independent:

$$ N X_i | \sum_n X_n \sim \Gamma(N, \tfrac{1}{N}\sum_n X_n) $$

Something else I've wondered about. Suppose I do have information about $\theta$, say a distribution. How can I incorporate this into the model?

Edit: Clarified that N is fixed.

Best Answer

If you have a Poisson process, then each $X_i$ has an exponential distribution (and their sum a Gamma distribution).

You can then use Bayesian methods to look at possible values of $\theta$. For example, the conjugate prior for an exponential distribution is also a Gamma distribution, which you may find helpful incorporating your prior information and observed data to get a credible interval for $\theta$.

Related Question