[Math] How does Hessian matrix describe the local curvature

calculus

For a 2-variable function $f(x,y)$, the Hessian matrix is
$$\mathcal{H}(f) = \left[\begin{array}{cc} \frac{\partial^2 f}{\partial x^2} & \frac{\partial^2 f}{\partial {xy}} \\
\frac{\partial^2 f}{\partial {xy}} & \frac{\partial^2 f}{\partial y^2}\end{array}\right]$$,
how does it describe the local curvature of $f$?

Best Answer

Are you familiar with the Taylor expansion of a single variable function?$$f(x)=f(x_0)+f'(x_0)\Delta x+\frac12f''(x_0)\Delta x^2+\dots$$We can also find Taylor expansions for multivariable functions; the first two terms should be familiar:$$f(\mathbf{x})=f(\mathbf{x}_0)+J(\mathbf{x}_0)\Delta\mathbf{x}+\frac12\Delta\mathbf{x}^TH(\mathbf{x}_0)\Delta\mathbf{x}+\dots$$... so we see that while the Jacobian corresponds to our original idea of the derivative, the Hessian instead serves a purpose like our second derivative -- it indeed captures local curvature information. In fact, if you look at its elements, this should become very clear.