Solved – How to combine likelihoods from multiple binomial trials. Can we also determine the overall FDR

binomial distributionfalse-discovery-ratemultiple-comparisonssimulation

A king has rounded up 1000 people suspected of counterfeiting coins, which look and feel the same as the official coin. However, only the official coin is truly fair (Pr(heads)=0.5), while all fake coins produce strongly skewed coin flip results (biased towards heads or tails).

The king decides to identify the counterfeiters by flipping a coin taken from each of the suspects. He leaves the actual flipping to 6 of his guards, handing them a suspected fake coin AND one official coin. The latter is to assure the king that his guards carried out the coin flips faithfully. Unfortunately, the king forgets to tell the guards how often to flip each coin, only that both coins (fake and official) must be flipped an equal number of times. All 1000 suspected fake coins tested in this manner go through the hands of all 6 guards.

Because heads of proven counterfeiters will roll, the king cannot afford to be wrong and commit too many innocent people to die. But he's comfortable executing up to 50 innocent people.

How should the king analyze the coin flip data from his 6 guards to identify the counterfeiters among the 1000 suspects? Can he also find a way to keep the number of innocently convicted under 50?

To get started, here's a sample from the first suspect (heads/tails):

Guard 1: suspect coin (88/11), official coin (42/47)
Guard 2: suspect coin (38/5), official coin (22/21)
Guard 3: suspect coin (115/15), official coin (70/60)
Guard 4: suspect coin (39/33), official coin (35/37)
Guard 5: suspect coin (70/13), official coin (43/40)
Guard 6: suspect coin (22/18), official coin (19/21)

Thank you!

Best Answer

If the guards are independent of each other and the tosses are fair then it doesn't matter which guard tossed which coin or how many times each guard tossed each coin. The results for each coin can be grouped together. Thus for the coin that you give data the grand result is 372 heads from 467 tosses (a fairly convincingly biassed coin).

Rank the coins in order of the ratio of the likelihood of the maximally likely Pr(heads) divided by the likelihood of Pr(heads)=0.5 and the owners of the coins with the 50 highest ratios are your 50 best choices of culprits.

The likelihood function you need is:

$$ L(\theta) \propto \binom{n}{h}p^h(1-p)^{n-h} $$

where $\theta$ is the set of all possible values of $p$, Pr(heads), $n$ is the total number of tosses and $h$ is the number of heads observed. Plug in $p=\frac{372}{467}$ to get the likelihood of the most likely value of $p$ for the coin in your question and $p=0.5$ and divide the two values to get the likelihood ratio that represents the maximal strength of the evidence for that coin being biassed.

There is no need to do a significance test for this problem and so you do not need to combine P-values.

You can set criteria for how strong the evidence needs to be before you sentence a coin owner to death, or you can just kill the 50 against whom the evidence is strongest.

Related Question