How do the binomial distribution works

binomial distributionprobabilityprobability theory

I have the following question, we have just been looking at random variables and their distributions. So I know that a random variable (RV) is a function $X:\Omega\rightarrow M$ where $M$ is an arbitrary quantity. Now if we define a measure of probability $P$ on $\Omega$ then we can use RV to project this measure onto the set $M$ and get the image measure $P_X(A)=P(X^{-1}(A)) $where $A$ is a subset of $\Omega$, then $P_X$ is called the distribution of our RV.
Is this correct up to here?

Now we have looked at the binomial distribution, for example. There we take $p\in [0,1]$ and $X:\Omega \rightarrow \{0,…,n\}$ then we said that $P(X=k)= \binom{n}{k}p^k (1-p)^{n-k}$. But now it says in a lecture that the binomial distribution itself is $$\sum _{k=0}^n \binom{n}{k}p^k (1-p)^{n-k} \delta_k$$
Somehow I'm a bit confused now, firstly I don't see why we defined it differently and secondly I don't see why we have to take such a sum here. Could someone explain this to me? I would be very grateful.

Best Answer

Your first paragraph about random variables, distributions and image measures is correct.

It is important to distinguish between a binomial random variable, which is a random variable $X$, whose distribution $\mathbb{P}_X$ is the binomial distribution, and the binomial distribution itself, which is just a measure.

Suppose, that $X:\Omega \rightarrow \mathbb{R}$ is a random variable with distribution $$\mathbb{P}_X = \sum_{k=0}^n {n \choose k } p^k(1-p)^{n-k}\delta_k,$$ what can we then say about $\mathbb{P}(X=j)$ for $j\in \{0,\dots,n\}$? We compute using the formula \begin{align*} \mathbb{P}(X=j) &= \mathbb{P}_X(\{j\}) \\ &= \sum_{i=0}^n {n \choose k } p^k(1-p)^{n-k}\delta_k(\{j\}) \\ &= {n \choose j } p^j(1-p)^{n-j} \end{align*} where we have used that $$\delta_k(\{j\}) = \begin{cases} 1 & k=j \\ 0 & k\neq j\end{cases}.$$ And we see that two definitions do agree with eachother. The advantage of the second definition is, that $\sum_{k=0}^n {n \choose k } p^k(1-p)^{n-k}\delta_k(A)$ is well defined for all measurable sets $A \subseteq \mathbb{R}$ and not just singletons. Do note that some random variables (normally distributed r.v. for instance) do have $\mathbb{P}(X=j) = 0$ for all $j$.

Related Question