[Math] Explaining Why the Zero Conditional Mean Assumption is Important

conditional-expectationeconomicsregressionstatistics

I am currently relearning econometrics in more depth than I had before. One thing I am trying to make sense of currently is why it is necessary for the assumption of:
$$E(u\mid x)=E(u) $$
to be true (where $u$ is the error term).

Here is how I have tried to reason through it, although I am not sure if this is a good reasoning on why.

Let's say $u$ is somehow correlated with some variable $y$, which $x$ is also correlated with. In this case,
$$E(u\mid x) \not= E(u) $$
since for greater $x$ values the expectation of the error would go up or down since it is correlated with $x$ through the $y$ variable. With this being the case, the line of best fit would end up with greater or lesser expected errors as $x$ increases and decreases.

Is this what the zero conditional mean assumption is trying to say, or is there a better reasoning that I'm not hitting on?

Thank you!

Best Answer

This assumption means that the error $u$ doesn't vary with $x$ in expectation. Often $\mathbb{E}u=0$, so this means that the error is always centered on your prediction.

This is weaker than independence, though, where $\mathbb{E} [f(u)|x]=\mathbb{E}[f(u)]$ for all (measurable) functions $f$.

In particular, if we take $f(u)=(u - \mathbb{E}[u|x])^2=(u-\mathbb{E}u)^2$ it is possible that $\mathbb{E}[f(u)|x] = \operatorname{Var}(u|x)$ can vary with time with this assumption. In a different word: heteroskedasticity.