How does a single $d$ in Leibniz notation work? Eg, $y\,\mathrm{d}x-x\,\mathrm{d}y\equiv-x^2\,\mathrm{d}\left(\frac{y}{x}\right)$

calculusderivativesnotation

Recently a lecturer used this notation, which I assume is a sort of twisted form of Leibniz notation:

$$y\,\mathrm{d}x – x\,\mathrm{d}y \equiv -x^2\,\mathrm{d}\left(\frac{y}{x}\right)$$

The logic here was that this could be used as:

$$\begin{align}
-x^2\,\mathrm{d}\left(\frac{y}{x}\right) &\equiv -x^2\,\left(\frac{\mathrm{d}y}{x} -\frac{y}{x^2}\,\mathrm{d}x\right)\\
&\equiv y\mathrm{d}x – x\mathrm{d}y
\end{align}
$$

Why is this legal?

I can see some kind of differentiation going on with the second term in the above equivalence, producing the $\frac{1}{x^2}$, but having the single $\mathrm{d}$ seems like a really weird abuse of notation, and I don't quite follow why it splits the single $\frac{y}{x}$ fraction into two parts.

Best Answer

You should know that the differential at a point $\mathbf x_0$ of a function $\;\mathbf R^m\longrightarrow \mathbf R^n$ is the linear map $\:\ell:\mathbf R^m\longrightarrow \mathbf R^n$, that yields the best linear approximation of $f(\mathbf x_0)$ in a neighbourhood of $\mathbf x_0$, in the sense that we have $$f(\mathbf x_0+\mathbf h)=f(\mathbf x_0)+\ell(\mathbf h)+o\bigl(\|\mathbf h\|\bigr).$$ This differential is denoted $\:\mathrm d f_{\mathbf x_0}$ (or simply $\mathrm df$ for the differential at a generic point). A linear function is of course its own differential.

With the usual abuse of language that denotes a function $f$ by its value at a given point, we obtain that the differential of the $i$-th projection $\:p_i:\mathbf x=(x_1,x_2,\dots,x_m)\longmapsto x_i$ is denoted $\mathrm dx_i$.

As an example, in the case of a function of a single variable $x$, the linear map defining the differential simply corresponds to the equation of the tangent to the graph of $f$ with abscissa $x_0$: $$f(x_0+h)=f(x_0)+f'(x_0)h,\enspace\text{i.e. }\quad \mathrm df_{x_0}:h\longmapsto f'(x_0)h,$$ that we may write as $\enspace\mathrm df=f'(x)\,\mathrm dx$. This notation is generalised to functions of $m$ variables under the form $$\mathrm df=\frac{\partial f}{\partial x_1}\,\mathrm dx_1+\frac{\partial f}{\partial x_2}\,\mathrm dx_2+\dots+\frac{\partial f}{\partial x_m}\,\mathrm dx_m. $$ The usual formulæ for the derivatives have a ‘differential version’:

  • $\mathrm d(f+g)=\mathrm df+\mathrm dg$,
  • $\mathrm d(fg)=f\,\mathrm dg+g\,\mathrm df$,
  • $\mathrm d\Bigl(\dfrac fg\Bigr)=\dfrac1{g^2}\bigl(g\,\mathrm df-f\,\mathrm dg\big),$
  • $\mathrm d(g\circ f)=\mathrm dg_{f(x)}\circ\mathrm df_x$.