Probability Theory – Formal Definition of a Random Variable

measure-theoryprobability theoryrandom variables

The concept of random variable is not new to me and I know the measure theory. Anyway, I started reading the book "Stochastic Differential Equation" by B. Oksendal, and I'm having some problems in understanding.

Given a probability space $(\Omega, \mathcal{F}, P)$, $X : \Omega
\rightarrow \mathbb{R}^n$
is a random variable if $X$ is a
$\mathcal{F}$-measurable function, or equivalently, if

$$ X^{-1}(U) = \{\omega \in \Omega; X(w) \in U\} \in \mathcal{F} $$

for all the open set $U \subset \mathbb{R}^n$.

My main problem is that I always thought that

$$X \in \Omega$$

while the book by B. Oksendal asserts that $X : \Omega \rightarrow \mathbb{R}^n$ and hence $X(\omega) \in \mathbb{R}^n$ for $\omega \in \Omega$.

For example, consider a fair dice. Then, $\Omega = \{1, 2, 3, 4, 5, 6\}$.
One can be concerned to evaluate the probability that the dice result is even. In this case, we write:

$$P(X \in \{2, 4, 6\}) = …$$

In general, the probability of an event $U \subset \Omega$ ($U \in \mathcal{F}$) is normally written as follows:

$$P(X \in U)$$

This notation suggest me that $X \in \Omega$. Again, I don't understand what means that $X : \Omega \rightarrow \mathbb{R}^n$.

Best Answer

The way I like to think of it is that it is a function that, in a sense, relieves the problem of dealing with nonnumerical elements by assigning each of them a real number (or real-valued vector) so that they can be compared on the real number line.

For instance, let's say I want to figure out how likely it is for a randomly considered member of the population to be no taller than my bio teacher Jim is. Even though it is not impossible to assign a set whose members fit this criteria working entirely within the sample space of human beings, and then to assign a certain measure of probability $P$ to that set, assigning to each person their height is a function that makes this task a bit easier.

Numerically speaking, let $X$ be the function from the sample space of human beings to the (nonnegative) real line that assigns to each person $\omega$ a height $X(\omega)$. Let's say Jim's height is 6 (feet). When we say, then, in layman's terms, what percentile Jim's height represents, what we mean to do is to ask for $P\{\omega : X(\omega) \leq X(Jim)\}$, which is the measure of the set of people (i.e. the probability measuring of the set of people) whose heights do not exceed Jim's. Notice that the function sends us to a nice, ordered place where just considering the people themselves without this numerical value would not be sufficient for this task.

Related Question