I am reading Introduction to Probability by Blitzstein and Hwang – Expectation. The book states :
An urn contains $w$ white balls and $b$ black balls, which are randomly drawn one by one without replacement. The number of black balls drawn before drawing any white balls has a negative hypergeometric distribution. For example, if we shuffle a deck of cards and deal them one at a time, the number of cards dealt before uncovering the first ace is a negative hypergeometric with $w=4,b=48$.
Finding the expected value of a negative hypergeometric r.v. directly from the definition results in very complicated sums of products. But the answer is very simple-looking: $b/(w+1)$.
Let us prove this using indicator r.v.s. Label the black balls as $1,2,3,\ldots,b$ and let $I_{j}$ be the indicator of the black ball $j$ being drawn before any white balls have been drawn. Then, $P(I_{j}=1)=1/(w+1)$, since listing out the order in which black ball $j$ and the white balls are drawn (ignoring the other balls), all orders are equally likely by symmetry, and $I_{j}=1$ is equivalent to black balls $j$ being first in this list.
I know that, when sampling without replacement, the number of failures(drawing a black ball) until the first success(drawing a white ball) is a negative hypergeometric r.v.
But, why is $P(I_{j}=1)=1/(w+1)$? And why do we ignore the other black balls?
Best Answer
Just to give the question a formal answer (related to BGM's comments and Quasar's responses):