[Math] Is Jensen’s inequality an iff condition on convex functions

convex-analysis

According to wikipedia this is Jensen's inequality:

If X is a random variable and φ is a convex function, then:

$$\varphi\left(\mathbb{E}[X]\right) \leq \mathbb{E}\left[\varphi(X)\right].$$

Which stated as an implication reads as follows:

If Convex $\implies $ $\varphi\left(\mathbb{E}[X]\right) \leq \mathbb{E}\left[\varphi(X)\right].$

However, I was wondering if the converse was also true.

$\varphi\left(\mathbb{E}[X]\right) \leq \mathbb{E}\left[\varphi(X)\right]$ $\implies $ Convex.

I thought it would be an iff because of the definition of convexity. Recall what convexity means (and let me explain why I thought iff for jensen's):

f is called convex if:
$$\forall x_1, x_2 \in X, \forall t \in [0, 1]: \qquad f(tx_1+(1-t)x_2)\leq t f(x_1)+(1-t)f(x_2).$$

Since its a definition, the the word convex and its definition statement are logically equivalent. Therefore, its an iff, i.e.

f is called convex $ \Leftrightarrow \forall x_1, x_2 \in X, \forall t \in [0, 1]: \qquad f(tx_1+(1-t)x_2)\leq t f(x_1)+(1-t)f(x_2).$

right? Does that logic generalize to Jensen's inequality or am I wrong for both?

Context:

The reason that I care about this is because I wanted to proof that a multivariable function f is convex. My plan of attack was to show Jensen's inequality "worked" on it and since Jensen's inequality and convexity are logically equivalent, then I can conclude that f is convex. But that only works if its an iif. Also, I absolutely wanted to avoid taking the Hessian or derivatives to prove it was convex, I strictly wanted to stick with the "original" definition of convexity to prove this (i.e. the inequality definition).

Best Answer

The convexity of $f$ is equivalent to Jensen's inequality holding for all random variables $X$. (If it holds for all $X$, then in particular it holds for the random variable with $P(X=x_1)=t$ and $P(X=x_2)=1-t$, where Jensen reduces to the definition of convexity.) So if somehow it's easier to prove Jensen for all $X$ (perhaps because the generality of that statement clears away distractions), then sure, your method is sound.

Related Question