Probability – Finding the Fake Coin

expected valueprobabilityvariance

Problem from an old exam:

We have $m$ coins, one of which is fake. We know that on a real coin, heads occur with a probability of $\frac{1}{2}$, and on the fake coin, it occurs with a probability of $p > \frac{1}{2}$.
We search for the fake coin in the following way: we toss each coin $n$ times and count the number of heads. The procedure indicates the coin with the most heads (in case of a tie, the procedure does not indicate any coin).

Prove that for every $p > \frac{1}{2}$, $m \geq 2$, and $\epsilon > 0$, there exists an $n$ such that the above procedure identifies the fake coin with a probability of at least $1-\epsilon$. Find $n$.

$ Y_i = \begin{cases}
Y_i = 1, & \text{if the } i\text{-th toss of fake coin results in heads} \\
Y_i = 0, & \text{otherwise}
\end{cases}$

Let $Y = \sum_{i=1}^n{Y_i}$ denote the number of heads in $n$ tosses of a fake coin

We have $$\mathbb{E}[Y] = n \cdot p$$

$$\text{Var}[Y] = np(1 + (n-1)p) – (np)^2$$

Let $X_k$ denote the number of heads in $n$ tosses of a k-th fair coin.
$$
\mathbb{E}[Y^2] = \mathbb{E}\left[\sum Y_i Y_i + \sum\limits_{i \neq j} Y_i Y_j\right] = \mathbb{E}[Y] + \mathbb{E}\left[\sum_{i=1}^{n}\sum_{j=1}^{n} Y_i Y_j\right]
$$

For the test to correctly identify the fake coin, $P(\max(X_1, …, X_{n-1}) < Y)$ must hold, i.e., $P((X_1 < Y) \land (X_2 < Y) \ldots \land (X_{n-1} < Y))$.

I want to calculate $P(X < Y) = P(X – Y < 0)$.
Let $D = X – Y$. We have
$$\mathbb{E}[D] = \mathbb{E}[X] – \mathbb{E}[Y]$$
$$ Var[D] = Var[Y] + Var[X]$$

How can I find $P(D < 0)$ with this information? Is this approach suitable for this question?

Thanks.

Best Answer

You can compute this by a relatively trivial union bound. Using the notation in the post, we have that if $X = \max(X_1, \dots, X_{m-1})$, then it suffices to show that $P(X \geq Y) \to 0$ for fixed $p > 1/2, m \geq 2$. $$ P(X \geq Y) = P(X_1 \geq Y \text{ or } X_2 \geq Y \text{ or } \dots \text{ or } X_{m-1} \geq Y ) \leq \sum_{i=1}^{m-1} P(X_i \geq Y) = \sum_{i=1}^{m-1} P(X_i - Y \geq 0). $$ Put $Z_i = Y - X_i$; we have $$E[Z_i] = n(p - 1/2) = n\mu > 0$$ and $$\operatorname{Var}(Z_i) = \operatorname{Var}(X_i) + \operatorname{Var}(Y) = \frac{n}{4} + np(1-p) = n\sigma^2 > 0$$ for some positive constants $\mu, \sigma^2$. A Chebyshev bound gives immediately that $$ P(Z_i \leq 0) \leq P(|Z_i - n\mu| \geq n\mu) \leq \frac{n\sigma^2}{(n\mu)^2} = \frac{\sigma^2}{n\mu^2} $$ so $$ P(X \geq Y) \leq \sum_{i=1}^{m-1}P(X_i - Y \geq 0) = \sum_{i=1}^{m-1}P(Z_1 \leq 0)= \frac{m\sigma^2}{n \mu^2}. $$ Clearly this is goes to $0$ as $n \to \infty$, as desired.

In general I suspect that the moments of $X$ might be annoying to calculate, especially in an exam, so one would like to pass to individual $X_i$.

Related Solutions

[Math] We have two coins, A and B. For each toss of coin A, the probability of getting head is 1/2…

It is just the application of the Law of total Expectation.

$$E(X)=\sum_{i} E(X|A_i)\cdot P(A_i)$$

In your case the definitions are: $X:=$random variable for the number tosses to get head first, $A_1$: Coin with head probabilty equal to $\frac12$ is selected, $A_2$: Coin with head probabilty equal to $\frac13$ is selected.

Due to the geometric distribution (as you mentioned) $E(X|A_1)=\frac{1}{p_1}=2$ and $E(X|A_2)=\frac{1}{p_2}=3$. And $P(A_1)=\frac14,P(A_2)=\frac34$. Consequently we have

$$E(X)=E(X|A_1)\cdot P(A_1)+E(X|A_2)\cdot P(A_2)=\frac14\cdot 2+\frac34\cdot 3=\frac{11}{4}$$

I think you are already familiar to the Law of total probability. This has a similar structure as the Law of total expectation, but for probabilities:

$$P(A)=\sum_{i} P(A|B_i)\cdot P(B_i)$$

One may say that $E(X)$ is the weighted mean of the conditional expectations.

Coin Tosses and Variance 3 Runs of Heads

In the following we use $n$ instead of the longer string $10$. (Number of people.) The indices $j,k$ will be considered modulo $n$. (So $j\pm1$ is also considered after applying $\pm1$ modulo $n$.) The following works for any $n\ge 6$.

Let $X_k$ be the random variable on $\{0,1\}^n$ which is $1$ if the components $k-1,k,k+1$ are all heads, else $0$.

The computation of $\Bbb E X_k = \frac 1{2^3}= \frac 18$ is ok, so $$\Bbb E X =\Bbb E\sum_k X_k =\sum_k \Bbb E X_k = \sum_k \frac 18 = \frac n8\ .$$

Now we compute explicitly for some fixed $k$: $$ \begin{aligned} \Bbb E X_k^2 &=\frac 1{2^3}\ ,\text{ positions $k-1,k,k+1$ are head,}\\ \Bbb E X_kX_{k\pm 1} &=\frac 1{2^4}\ ,\text{ positions $k-1,k,k+1$ and also $k\pm2$ are head,}\\ \Bbb E X_kX_{k\pm 2} &=\frac 1{2^5}\ ,\text{ positions $k-1,k,k+1$ and also $k\pm2,k\pm 3$ are head,}\\ \Bbb E X_kX_j &=\frac 1{2^6}\ ,\text{ positions $k-1,k,k+1$ and also $j-1,j,j+1$ are head,} \end{aligned} $$ the index $j$ being not among the neighbors of distance $\le 2$ to $k$. So $$ \begin{aligned} \Bbb EX^2 &= \Bbb E \sum_{k,j}X_kX_j\\ &= \sum_k\sum_j\Bbb E X_kX_j\\ &=\sum_k\left( \frac 1{2^3} +\frac 1{2^4}+\frac 1{2^4} +\frac 1{2^5}+\frac 1{2^5} +(n-5)\frac 1{2^6} \right) \\ &= \sum_k\frac 1{2^6}(8+4+4+2+2+(n-5)) = \frac {n(n+15)}{64}\ . \end{aligned} $$ So the variation of $X$ is $$ \sigma^2:= \operatorname{Var}[X] = E[X^2]-E[X]^2 = \frac {n(n+15)}{64} - \left(\frac n8\right)^2 = \frac {15n}{64} \ . $$ So the standard deviation $\sigma$ is the square root of this number, a specific constant times $\sqrt n$.

So we apply the inequality of Cebîshev: $$ \Bbb{P}(\ |X-\Bbb{E}(X)| \geq c \sqrt{n}\ ) = \Bbb{P}\left(\ |X-\Bbb{E}(X)| \geq c \cdot\frac 8{\sqrt {15}}\sigma\ \right) \le \left(\frac {\sqrt{15}}{8c}\right)^2 =\frac {15}{64c} \ . $$

For my safe i wanted to verify the above, the following rather simple sage code confirms the results:

for n in [6..12]:

    R = [0, 1]
    C = cartesian_product( [ R for _ in range(n) ] )
    p = 1/2^n    # weight of each element in the probability space C

    M1 = 0
    M2 = 0

    for c in C:
        count = len( [ k for k in range(n)
                       if  c[k]       == 1
                       and c[(k-1)%n] == 1
                       and c[(k+1)%n] == 1 ] )
        M1 += p * count
        M2 += p * count^2

    V  = M2 - M1^2

    print "n = %s" % n
    print "\t1. st moment = %s" % M1
    print "\t2. nd moment = %s" % M2
    print "\tVariation    = %s" % V

Results:

n = 6
        1. st moment = 3/4
        2. nd moment = 63/32
        Variation    = 45/32
n = 7
        1. st moment = 7/8
        2. nd moment = 77/32
        Variation    = 105/64
n = 8
        1. st moment = 1
        2. nd moment = 23/8
        Variation    = 15/8
n = 9
        1. st moment = 9/8
        2. nd moment = 27/8
        Variation    = 135/64
n = 10
        1. st moment = 5/4
        2. nd moment = 125/32
        Variation    = 75/32
n = 11
        1. st moment = 11/8
        2. nd moment = 143/32
        Variation    = 165/64
n = 12
        1. st moment = 3/2
        2. nd moment = 81/16
        Variation    = 45/16

Best Answer

Related Solutions

[Math] We have two coins, A and B. For each toss of coin A, the probability of getting head is 1/2…

Coin Tosses and Variance 3 Runs of Heads

Related Question