Conditional Probability – How to Develop Intuition for Conditional Probability

conditional probabilityintuitionprobability

In the video lectures from Harvard's Statistics 110:Probability course that can be found on iTunes and YouTube, I encountered this problem. I tried to summarize it here:

Suppose we are given a random two-card hand from a standard deck.

  1. What is the probability that both cards are aces given that we have at least one ace?

$$
P(both\ aces | have\ ace) = \frac{P(both\ aces, have\ ace)}{P(have\ ace)}
$$

Since having at least one ace is implied if you have both aces, the intersection can be reduced to just $P(both\ aces)$

$$
P(both\ aces | have\ ace) = \frac{P(both\ aces)}{P(have\ ace)}
$$

This is then just

$$
P(both\ aces | have\ ace) = \frac{4C2\ /\ 52C2}{1-48C2\ /\ 52C2}=\frac{1}{33}
$$

  1. What is the probability that both cards are aces given that we have the ace of spades?

$$
P(both\ aces | have\ ace\ of\ spades)=\frac{P(both\ aces, have\ ace\ of\ spades)}{P(have\ ace\ of\ spades)}
$$

$$
P(both\ aces | have\ ace\ of\ spades)=\frac{(3C1*1C1)\ /\ 52C2}{2!*\frac{51}{52}*\frac{1}{51}}=\frac{1}{17}
$$

Now, somewhere along these examples I got lost…

The latter is obviously just the same as $\frac{3}{51}$, which makes a lot of sense (to me) that this would be the answer. If you are told that you have the ace of (say) spades, then you know that there are $3$ more aces and $51$ more cards.

But in the former example, the math seems fine (and I believe the lecturer wouldn't give this example if it was incorrect…), but I can't wrap my head around this.

How do I get some intuition for this problem?

Best Answer

To aid the intuition, consider visualizing two events (sets of outcomes):

  1. The conditioning event, which is the information given.

  2. The conditioned event, whose probability you would like to find.

The conditional probability is found by dividing the chance of the second by the chance of the first.


There are $52 \times 51$ equally likely ways to deal two cards at random. A convenient way to visualize these deals is to lay them out in a table with rows (say) designating the first card dealt and columns the second card in the deal. Here is a part of this table, with ellipses ($\cdots$) designating the missing parts. Notice that because the two cards cannot be the same, no entries exist along the main diagonal of the table. The rows and columns are ordered from aces up through kings:

Figure 1

The questions focus on aces. The information "we have at least one ace" locates the pair within either the first four rows or the first four columns. In our mind we can visualize that schematically by coloring these rows and columns. I have colored them red, but where both aces appear I have colored them black:

Figure 2

There are $2\times 6 = 12$ pairs of all aces and $2\times (4\times 48) = 384$ other pairs with at least one ace, for a total of $12+384=396$ pairs on which you are conditioning, as represented by both the red and black areas. Because all such pairs are equally likely, the chance of the former is

$$\frac{12}{396} = \frac{1}{33}.$$

It is the black fraction of the red+black region.

The second question asserts "we have the ace of spades." This corresponds only to the very first row and column:

Figure 3

Now there are just $2\times 3 = 6$ such pairs with two aces and $2\times 48=96$ other pairs with the ace of spades, for a total of $96+6 = 102$ such pairs. Reasoning exactly as before, the chance of two aces is

$$\frac{6}{102} = \frac{1}{17}.$$

Again it is the black fraction of the red+black region.

For reference, the last figure includes the previous one shown in pink and gray. Comparing these regions reveals what happened: in going from the first question to the second, the number of pairs in the conditioning event (pink) dropped to about one-quarter of its original count (red), while the number of pairs in question dropped by only one-half (from gray to black, $12$ to $6$).


I have found such schematic figures to be helpful even--perhaps especially--when trying to understand more complicated concepts of probability, such as filtrations of sigma algebras.

Related Question