Intuition on independence of two events.

probability

The following question is taken from a textbook: Stanley H. Chan: Introduction to Probability for Data Science, 2021. (pp. 87-88)

Please note that this is not a question on how to calculate independence, I am fully aware of this. Instead, the intuition is what I am looking for in the below confusing question.

Consider the experiment of throwing a die twice. One should be clear from the context that the outcomes are in the form of a tuple $(\textbf{dice_1}, \textbf{dice_2})$ and the sample space is:

$$
S = \left\{(1, 1), (1, 2), \ldots, (6, 6)\right\}
$$

Define the three events below:

$$
A = \{\textbf{1st dice is 3}\} \quad B = \{\textbf{sum of two die is 7}\} \quad C = \{\textbf{sum of two die is 8}\}
$$

We want to find out if events $A$ and $B$ are independent? How about $A$ and $C$?

Now the answer is easy to get if you just use the formula and show that if $P(A \cap B) = P(A)P(B)$ and in a similar vein for event $A$ and $C$. However, I want to understand it more intuitively as my intuition failed me immediately, when I saw the question I thought that both should have similar answer since they are asked similarly.


We focus on the independence of $A$ and $C$ first. The author said that intuitively, given that event $C$ has happened, will this affect the probability of $A$ happening? I assume that this means we do have to know the probability of event $A$ without $C$ first.

We can enumerate and see that event $A$ has the following set representation:

$$
A = \{(3, 1), (3, 2), (3, 3), (3, 4), (3, 5), (3, 6)\}
$$

which amounts to $P(A) = \frac{6}{36} = \frac{1}{6}$. Now if $C$ happened, we know that the two rolls have a sum of $8$, and we cannot construct a sum of $8$ with a roll of $1$. To me, I immediately know that event $A$ cannot have the outcome that has a $1$ in the second roll, and thus the outcomes should only be limited to $5$ instead of $6$ and hence dependence is established.

However, I believe somewhere my intuition is flawed, the author mentioned that:

If you like a more intuitive argument, you can imagine that C has happened, i.e., the sum is 8. Then the probability for the first die to be 1 is 0 because there is no way to construct 8 when the first die is 1. As a result, we have eliminated one choice for the first die, leaving only five options. Therefore, since C has influenced the probability of A, they are dependent.

I think I cannot understand why the author mentioned about "first die" when in event $A$, the first die is already a $3$. If we follow this line of logic, does this mean we do not actually need to know how event $A$ is defined? Is my interpretation wrong?

Best Answer

Define the three events below: $$A = \{\textbf{1st dice is 3}\} \quad B = \{\textbf{sum of two die is 7}\} \quad C = \{\textbf{sum of two die is 8}\}$$

Neither your explanation nor the author's is correct.

  • Let $S$ be the event that the first roll is smaller than $10.$

    Then, by the author's reasoning:

     event $C$ occurring affects the options for the first roll and consequently event $S,$ so events $C$ and $S$ are dependent.

    This is of course false, since $S$ is a certain event and is thus independent of every other event associated with the experiment.

  • By your reasoning:

     event $B$ occurring immediately eliminates the outcome $(3,6),$ so limits how event $A$ can eventuate, so events $B$ and $A$ are dependent.

    This is false, since events $P(B\cap A)=P(B)P(A).$

When considering whether events $X$ and $Y$ are independent, the correct intuition is to ask whether knowing that $X$ happens changes the probability of $Y$—not whether it restricts how $Y$ can occur.

I wrote more here.

Related Question