Is there a rigorous definition for matrix derivatives

convex-analysisdefinitionmatrix-calculusmultivariable-calculusoptimization

I know that,

A function $f: \mathbb{R}^n \to \mathbb{R}$ is said to be differentiable at $x$ if there exists a vector $v$ such that,
$$
\lim_{h \to 0} \frac{f(x+h) – f(x) – v^Th}
{\|h\|} = 0.
$$
When $v$ exists, it is given by the "gradient" $\nabla f(x) = \left(\frac{\partial f}{\partial x_1}, …, \frac{\partial f}{\partial x_n}\right)(x)$

Does there exist a similar definition for "matrix derivative"

https://en.wikipedia.org/wiki/Matrix_calculus#Derivatives_with_matrices

Best Answer

Matrix derivation is just a particular case of Fréchet derivative between two Banach spaces. Which by the way is very similar in term of definition to the definition of the derivative of a function $f : \mathbb R^n \to \mathbb R$ provided in the question.

Applied to matrix derivatives, you just have to consider a map $f : V \to M$ where $M$ is a linear space of matrices endowed with the norm of your choice and $V$ a Banach space that can be (or not) of finite dimension.

Best Answer

Related Solutions

Gradient Definition for Non-Cartesian Coordinates – Multivariable Calculus

[Math] Hessian matrix as derivative of gradient

Related Question