[Math] Conditioning on a random variable

conditional probabilityprobabilitystochastic-integrals

The number of storms in the upcoming rainy season is Poisson distributed but with a parameter value that is uniformly distributed between (0,5). That is Λ is uniformly distributed over (0,5), and given Λ = λ, the number of storms is Poisson with mean λ. Find the probability there are at least three storms this season


Ok, I feel like I can get started with this problem, but I'm having a really difficult time wrapping my head around the idea of conditioning on a random variable. and I would appreciate if somebody could explain to me, in simple terms, what is going on when we condition on a random variable. I haven't been able to find a good explanation of this concept and I've seen this problem done both by taking the expectation of $\Lambda$ and placing it into the PMF of the Poisson distribution and by multiplying the Poisson the PDF of $\Lambda$, $\frac{1}{5}$ over the interval [0,5] and integrating the product from 0 to 5 w.r.t. $\lambda$.

Anyway, here goes nothing

Obviously, the problem starts out quite simple:

Let $X =$ number of storms in the season
Let $\Lambda =$ the rate of the Poisson distribution, which varies as a uniform r.v. over [0,5].

We want the following:

$Pr(X \geq 3|\Lambda = \lambda) = 1- Pr(X < 3|\Lambda = \lambda) $

And then of course we can count down to zero storms in a season in a summation. But that's where I get lost. I don't know how to handle the random parameter and more-so how to handle these type of problems in general and I'd appreciate some feedback. Please note that I am not looking to solve this problem using the definition of conditional probability. I am trying to solve this by conditioning on $\Lambda$, that is, treating it as a constant.

Please note that this is NOT homework but rather for independent study.

Best Answer

I hope I am not repeating concepts you are already familiar with.

Roughly speaking, a random variable can take one of many values. Which exact value will appear as the outcome of your experiment is not known in advance to you. The probability distribution gives the frequency of these values if you were to perform a large number of experiments.

When you condition on a random variable, you fix a particular value for that variable and do the calculations you want. However, you are not supposed to know the value (otherwise, it will not be a random variable!), so you uncondition to get what you will get after a large number of experiments.

In your example, you first condition on $\Lambda$ taking the value of $\lambda$, and then compute the probability of three or more storms using $$ P(X\geq 3| \Lambda = \lambda). $$ Since $\Lambda$ is a random variable fixing it to one particular $\lambda$ does not give you the whole picture because $\Lambda$ can take any value between $0$ and $5$. Some of these values may appear more often than others, and you need to take this into account. And, this is done by unconditioning on $\Lambda$. That is, $$ P(X\geq 3) = \int_0^5 P(X\geq 3|\Lambda = \lambda)f_\Lambda(\lambda)d\lambda $$