Probability – Defining Expectation of Measurable Function with Non-Probability Measure

expected valueintegrationlebesgue-integralmeasure-theoryprobability

The typical definition of expectation requires a probability space and a random variable

Let $(\Omega, \mathcal{F}, \mathbb{P})$ be a probability space, $(\mathsf{X}, \mathcal{X})$ be a measurable space and $\mathrm{X}:\Omega\to\mathsf{X}$ be a random variable. The expectation of $\mathrm{X}$ is defined as
$$
\mathbb{E}[X] := \int_\Omega X \,\,d\mathbb{P}.
$$

Can we relax this? Meaning, can I sensibly define the expectation of a measurable function, given a measure space $(\Omega, \mathcal{F}, \mathbb{M})$, where $\mathbb{M}$ is now simply a measure?


Notes:

  1. I am using the following classical distintion between a random variable and a measurable function: every random variable is a measurable function, but not any measurable function is a random variable. For a measurable function to be a random variable, we need the existance of a probability measure on $(\Omega, \mathcal{F})$.
  2. In theory, I know this is generalisable because we typically work with Lebesgue integrals and the Lebesgue measure is not a probability measure, in general. However, what condition on $\mathbb{M}$ do I need? Intuitively, my guess is that it needs to be a $\sigma$-finite measure.

Summary

Thank peek-a-boo for the helpful answer and discussion, here's a summary.

Let $(\Omega, \mathsf{F}, \mathbb{M})$ be any measure space, and let $(\mathsf{X}, \|\cdot\|_\mathsf{X})$ be any Banach space, where $\|\cdot\|_{\mathsf{X}}$ is the norm on $\mathsf{X}$. Let $\mathcal{B}(\mathsf{X})$ be the Borel sigma algebra on $\mathsf{X}$ that has been generated by the topology $\tau_\mathsf{X}$ which has been induced by the metric $d(x, y) := \|x – y\|_{\mathsf{X}}$ which has itself been induced by the norm $\|\cdot\|_{\mathsf{X}}$. Then we can define the $L^p$ spaces
$$
L^p(\Omega, \mathcal{F}, \mathbb{M}) := \left\{f:\Omega\to\mathsf{X}\left| f \text{ is } \mathcal{F}\text{-measurable and } \left(\int \|f(\omega)\|_{\mathsf{X}}^p \, \mathbb{M}(d\omega)\right)^{1/p} < \infty \right.\right\}, \qquad p\geq 1.
$$

These exist both for finite and infinite-dimensional Banach spaces $\mathsf{X}$. The larger $p$, the more structure we are giving. Meaning that under mild regularity conditions, the spaces are nested $L^q \subset L^p$ for $q > p$. In particular, for $p = 1$ all functions in $L^1$ are Bochner integrable.

An integral is called expectation when $\mathbb{M}$ is a probability measure (or finite, I suppose, since by rescaling they are equivalent). Then expectations exist when the measurable function $f$ is in $L^1$.

Best Answer

For any measure space $(X,\mathfrak{M},\mu)$, and any finite-dimensional normed vector space $V$, one has a very standard theory of integration for maps $f:X\to V$ which are $\mathfrak{M}$-$\mathcal{B}(V)$ measurable ($\mathcal{B}(V)$ being the Borel $\sigma$-algebra on $V$). Literally open up any measure theory book (Folland/Rudin/Stein-Shakarchi etc) and read one of the first few chapters (ok Stein does abstract measure theory in chapter 6 or somehting, but Folland and Rudin do it in Chapters 1/2). One can then talk all about the $L^p(\mu;V)$ spaces. Folland and Rudin may seem to describe things only for $V=\Bbb{C}$, but the case of finite-dimensional $V$ follows immediately.

For a general Banach space $V$ as the target, one has the notion of Bochner integral; see Lang or Amann-Escher for the details (Folland outlines the basics in an exercise, and Rudin talks about integration $V$-valued integrals when $V$ is some type of topological vector space in his functional analysis text… though briefly).

Note that for 99% of the basic theory, there is no reason at all to make any restrictions on the measure space on your domain. We don’t need finiteness, or $\sigma$-finiteness or semifiniteness or completeness or anything. For more complicated ‘structure theorems’ like the Radon-Nikodym theorem or the Lebesgue decomposition, or Fubini-Tonelli’s theorem you’re going to want $\sigma$-finite. But the finiteness assumption can easily be removed for MANY things.


So, for any $f\in L^1(\mu; V)$, we can talk about its integral $\int_Xf\,d\mu$; this is a vector in $V$. However, if $\mu$ is not a probability measure, one should NOT refer to this as an expectation value. Why? It’s just bad terminology. Also from an intuitive/linguistic perspective, ‘expectation’ is supposed to mean ‘the average value of something’, and average values are supposed to be the sum divided by the number of things (i.e this division is meant to provide a probability measure). Well, if you have an infinite measure space, then you’ll be dividing by $\infty$, which is just bad, or more concretely, there’s no natural way to convert an infinite measure $\mu$ into a probability measure. However, (as mentioned in the comments) if $0<\mu(X)<\infty$, then you can easily rescale to get a probability measure.