Probability – Solving a Problem on Polya’s Urn Scheme

polya-urn-modelprobability

In an urn with $b$ blue and $r$ red balls, each time (call it a trial) a ball is chosen at random and then put again in the urn along with $c$ extra balls of the same color. Now probability of getting a blue ball in the 1st trial = $\frac{b}{b+r}$. Surprisingly, I also see that probability of getting a blue ball in the $n$-th trial is also $\frac{b}{b+r}$. I don't understand the intuition behind this.

Another question is the intuitive argument to prove that the number of red balls in the first $n$ trials follow an uniform distribution between $0$ and $n$ when $b=r=c$.

Best Answer

The situation is easy to understand in the special case where $b=r$, since the whole situation is then symmetric between the two colors. Parts of the situation are asymmetric, for example the situation after you draw a blue ball on the first trial and add $c$ more blue balls. Obviously, blue is then favored. But there is an equally likely situation that equally favors red, so the overall situation is still symmetric.

Let me now try to explain a similar intuition in the non-symmetric case where $b\neq r$. Imagine that the $b+r$ balls initially in the urn are not only colored with two colors but also numbered, with $b+r$ distinct numbers, say numbers $1$ to $b$ for the blue balls and $b+1$ to $b+r$ for the red balls. Also, imagine that whenever a ball is drawn in a trial and is put back into the urn, the additional $c$ balls have not only the same color but also the same number as the ball that was drawn. (So, although the initial situation had different numbers on all the balls, later situations will have many balls with the same number.) Notice that, throughout the process, all balls numbered $1$ to $b$ will be blue and all balls numbered $b+1$ to $b+r$ will be red.

Now temporarily forget about colors and concentrate on numbers. The initial situation is symmetric between all $b+r$ numbers. Just as in the first paragraph above, the overall situation remains symmetric throughout the process. So all $b+r$ numbers have the same chance of being drawn in the $n$-th trial.

As before, there are asymmetries in conditional probabilities, for example the second ball drawn is more likely to have the same number as the first than to have any other particular number. But, as before, the asymmetries are equally likely to favor any of the numbers. So the overall probabilities are equal.

But (taking the colors into consideration again) this means that the probability of drawing a blue ball on the $n$th trial is $b/(b+r)$ because $b$ of the $b+r$ equally likely numbers correspond to blue balls.

Related Question