Boltzmann Entropy – Compact Sublevel Sets in Wasserstein Distance

measure-theoryoptimal-transportationpr.probabilityreference-requestwasserstein-distance

For Lebesgue-absolutely continuous probability measures $\rho\ll \mathcal{L}^d$ in the whole space $\mathbb{R}^d$ with finite second moments (i-e $\rho\in \mathcal{P}^2_{ac}(\mathbb{R}^d)$), let
$$
\mathcal{H}(\rho)=\int_{\mathbb{R}^d}\rho\log \rho \,dx
$$

be the Boltzmann entropy. It is well-known that $\mathcal{H}$ is lower semi-continuous (l.s.c.) in the Wasserstein metric $W_2$.

Question: Fix $C>0$. Is the sublevel set $\{\rho \in \mathcal{P}^2_{ac}(\mathbb{R}^d) : \mathcal{H}(\rho) \le C\}$ compact in $W_2$?

Thank you so much for your elaboration! Any reference is greatly approciated!

Best Answer

$\newcommand{\R}{\mathbb R}\newcommand{\H}{\mathcal H}$Building up on previous comments, and slighlty elaborating. The answer to your question, as is, is NO. Recall that convergence in the Wasserstein distance $W_2$ is equivalent to weak-* convergence (duality with continuous bounded functions) and convergence of the second moments (or equivalently, with all continuous functions growing at most quadratically at infinity). As a consequence, in order for your compactness to hold you would need to control the second moments in sublevelsets. However, the entropy is translation-invariant, so mass can escape at infinity in a sublevelset $\{\H\leq C\}$. More precisely, for a counterexample take as in @Kostya_I 's comment any measure $\rho$ with finite entropy $\H(\rho)=C<\infty$, take any $x_n\in\R^d$ such that $|x_n|\to\infty$, and consider the translated sequence $\rho_n=\rho(\cdot-x_n)$. Then obviously $\H(\rho_n)=\H(\rho)=C$ for any $n$, but $\rho_n$ fails to converge in the Wasserstein distance (since for example $\rho_n$ converges weakly to zero when tested with respect to any compactly supported function).

Since the problem here is only due mass escaping at infinity, for your desired compactness to hold you only need a little bit of confinement. For example it would suffice to control the second moment $\mathfrak m_2(\rho)=\int|x|^2\rho(dx)$, for example by adding a potential energy $$ \mathcal F(\rho)=\H(\rho)+\int V(x)\rho(dx) $$ for some potential $V(x)\gtrsim C|x|^2$ growing at least quadratically at infinity. For this you can use the handy Carleman estimate $$ \H(\rho)\geq -C_\alpha(1+\mathfrak{m}_2(\rho))^\alpha, \qquad \alpha\in \big(d/(d+2),1\big). $$ Check the original JKO paper [1], proof of proposition 4.1 (eqs. 14 and 15 therein). Actually you can replace the potential energy $\int V\rho$ by anything controlling $\mathfrak m_2(\rho)^{\frac{d}{d+2}}$ in order to retrive control over the moments from energy bounds $\mathcal F(\rho)\leq C\Rightarrow \mathfrak m_2(\rho)\leq M(C)$.

Connecting with @pseudocydonia 's answer above: another way to interpret the "potential energy above" would be to consider instead the relative entropy $\H_\mu(\rho)=\int \frac{\rho}{\mu}\log\left(\frac{\rho}{\mu}\right)d\mu$ for a log-quadratic reference measure $\mu\sim e^{-V}$ with $V\gtrsim C|x|^2$.


Side note: in your setting you also have a slightly worse problem than what you may suspect. When computed relatively to the Lebesgue measure on (sufficiently wide) unbounded sets the entropy is actually not bounded from below on $\mathcal P_{ac}^2$.


[1] Jordan, R., Kinderlehrer, D., & Otto, F. (1998). The variational formulation of the Fokker--Planck equation. SIAM journal on mathematical analysis, 29(1), 1-17.