Multivariable Chain Rule Formula doesn’t make sense to me

calculuschain rulederivativesmultivariable-calculuspartial derivative

Consider $z=f(x(t),y(t))$, then its chain rule formula is:

$\frac{dz}{dt} = \frac{\partial f}{\partial x} \frac{dx}{dt} + \frac{\partial f}{\partial y} \frac{dy}{dt}$

What I cannot make sense of, are the terms $\frac{\partial f}{\partial x}$ and $\frac{\partial f}{\partial y}$

$\frac{\partial f}{\partial x}$ is defined as the rate of change of $f$ with respect to $x$, when $y$ is held constant.

Now, in our original definition, both $x$ and $y$ depend on $t$. So if we hold $y$ constant, we'll have to hold $t$ constant, and thus $x$ will automatically be held constant. How, then, can we compute $\frac{\partial f}{\partial x}$ if we cannot hold only $y$ as constant?

Best Answer

What is confusing you is that the letters $x$ and $y$ are used for two purposes: (a) As names of the functions $x(t), y(t)$; (b) As names of the "first argument" and "second argument" of $f$. Sadly, this notation is very usual in mathematics, and there is no suitable alternative.

If I could convince to write partial derivatives differently, e.g. instead of $\frac{\partial f}{\partial x}$ you write $f_1'$ ("derivative on the first parameter"), and instead of $\frac{\partial f}{\partial y}$ you write $f_2'$ ("derivative on the second parameter"), then the formula becomes a lot more palatable:

$$z'(t)=\frac{d}{dt}f(x(t),y(t))=f_1'(x(t),y(t))x'(t)+f_2'(x(t),y(t))y'(t)$$

(I also write $x'(t)=\frac{dx}{dt}, y'(t)=\frac{dy}{dt}, z'(t)=\frac{dz}{dt}$, if that helps.)

Now it is obvious we are in no way modifying $x(t)$ or $y(t)$ when calculating $f_1'$ and $f_2'$. Instead, we are keeping constant the second argument of $f$/the first argument of $f$, respectively. After having calculated $f_1'$ and $f_2'$ - both end up as functions of two arguments - you will substitute those arguments with $x(t), y(t)$.

Sadly, this notation I used above is completely non-standard, and for historical reasons we are stuck with the notation that is confusing for you.

Best Answer

Related Solutions

[Math] Intuition of multivariable chain rule

[Math] How to apply the chain rule to double partial derivative of a multivariable function

Related Question