[Math] An Intuition to An Inclusion: “Union of Intersections” vs “Intersection of Unions”

elementary-set-theoryintuitionsequences-and-series

Let $E = \{E_k\}_{k \in \mathbb{N}}$ be an infinite sequence of sets. Then, the following inclusion holds:

$\bigcup_{n=1}^{\infty} \bigcap_{k=n}^{\infty} E_k \quad\subseteq\quad \bigcap_{n=1}^{\infty} \bigcup_{k=n}^{\infty} E_k$

I know the left-hand side (LHS) represents elements which belong to all but finitely many sets in the sequence $E$, and the right-hand side (RHS) represents elements which belong to infinitely many sets in the sequence $E$. This concludes the proof that the LHS is contained in the RHS. Yet it is not intuitive for me, since the above interpretation is not intuitive per se.

Is there an intuitive reason why the above inclusion holds? That is, (very informally) why union of intersections is contained in intersection of unions?

Best Answer

If $A$ and $B$ are sets, and $A\subseteq B_n$ for all $n$, then $A\subseteq\bigcap_n B_n$. Conversely, $A\subseteq\bigcap_n B_n$ means that $A\subseteq B_n$ for all $n$.

This tells us that the inclusion you are interested in reduces then to checking $(+)$: $$ \bigcup_n \bigcap_{k\ge n}E_k\subseteq \bigcup_{k\ge m}E_k $$ for all $m$.

(This is progress. It is similar to a common move in analysis: To prove that $a\le b$, it is not unusual to check instead that $a\le c$ for any $c>b$. But these inequalities $a\le c$ tend to be easier than the one you really want, $a\le b$.)

Now, the same idea gives us that to prove the new inclusion $(+)$, it is enough to prove $(++)$: $$\bigcap_{k\ge n}E_k \subseteq \bigcup_{k\ge m}E_k $$ for all $n,m$.

(The corresponding move in analysis is that to prove $a\le b$, it is enough to show $c\le b$ for all $c\le a$. Unions correspond to suprema, intersections to infima in these analogies.)

Now, $(++)$ is really obvious: If somebody is in the left hand side, then it is in all sufficiently large $E_k$, but then it is in the right hand side.

The point is: It is useful to train oneself to "decode" certain expressions the way we did, by focusing on their "atomic" or at least "more basic" components.

Related Question