[Math] Calculus: why do we define rate of change as $dy/dx$

calculus

I'm just starting to learn Calculus using Morris Klines' awesome book, "Calculus, an intuitive and physical approach." I really like it so far.

I'm just at the beginning, and after learning how to differentiate I was wondering why rate of change is defined exactly the same for every function. Allow me to explain.

If we deal, for example, with functions the describe distance traveled over time, and we search for the exact speed at a specific time along this distance, then I totally understand why we define the rate of change as $\frac{dy}{dx}$ – it follows perfectly the physical way speed is defined and being calculated: speed=distance/time. Here, $dy$=difference in distance='a distance' and $dx$=difference in time='an amount of time'. So it makes sense to me. All that is left to do is make $dx$ (time) approach 0 and calculate the result.

My confusion comes when we deal with other kinds of physical quantities. Physical quantities whose very physical definition\calculation has nothing to do with division. As an example, let us view the area of a rectangle: $A=a*b$. Allow me to differentiate it, please, so you'll see what I mean.

(Do forgive me as I do not know how to write subscripts on this forum.)

Let us assume a is a constant and b is the independant variable. It follows then that for $b=b_1$ we get:

$a_1 = a\cdot b_1.$

$a_2 = a\cdot (b_1+db)=a\cdot b_1+a\cdot d_b.$

$da = a_2-a_1=a\cdot b_1+a\cdot db-a\cdot b_1 = a\cdot db.$

So far, so good. But then, in the book, for some odd and strange reason, we simpy divide both sides of the equation by $db$.

As mentioned above, the area of a rectangle is defined by MULTIPLYING two adjacent sides. It has nothing to do with division. So finding the rate of change of the rectangle area should also have nothing to do with divison. (In my opinion, of course, and I'll sooon explain why.)

You may tell me that 'the rate of change of the rectangles' area' is just half the sentence – it needs to be in relation to something – and that is where the divison comes from. When you look at the relation between two things mathematically – you divide them. Hence, relation is a quotient by definition. But I disagree, and here is why.

IMO, right where we stopped when we found the derivative of the rectangles' area IS the definition of 'rate of change in the area of the rectangle with relation to one of its sides' – it is right there in the last equation – it is the difference in areas ($dA$) between a rectangle whose side is $b_1$ and another whose side is slightly longer, $b_1+db$. I see here three variables: $dA$, $b_1$ and $db$. To me, that equation is also a mathematical relation between them that explains how they change with relation to one another.

Following this logic – all we need to do now is let db get smaller and smaller until it reaches $0$ to find the exact change in area at $b_1$. But when we do so, the entire right side equals $0$. (Which stands to reason, by the way, because what it actually means is that we subtract the areas of two identical rectangles – so it should indeed zero and cancel out.)

To me, this seems like the right way to calculate the rate of change IN THIS PARTICULAR SITUATION – an area of a rectangle with one side fixed as the other varies, as compared to dividing $dA$ by $db$.

It seems to me that at times, using $\frac{dy}{dx}$ really is the right and logical choice, and in others, we use it to kind of "cheat" because it is an algebraic trick that yields us a solution other than 0.

So, why is it that we define the rate of change EXACTLY the same for every function?

Best Answer

Besides thinking of derivatives as rates of change one can think about it as "the best linear approximation". Given any function $f $ depending on a variable $x $ we may inquire what the best linear approximation of the function around some fixed value $x_0$ is. The answer is that $f (x_0+\epsilon) \approx f (x_0)+\epsilon f'(x_0) $ where $f'(x_0) $ is the derrivative of $f $ evaluated at $x_0$, that is the slope of the function at that point. This interpretation generalizes easily to functions of several variables.

When thinking of rates of change, imagine a rectangle whose one side has a fixed length $l $ and the other depends on time. Suppose the other side depends on time via $b (t)=ct $ where $c $ has units of velocity, that is to say: The $b $ side of the rectangle moves with velocity $c $ making the area of the rectangle larger over time. The area as a function of time is $A (t)=lb (t)=lct $. The rate of change has units area/time. It is $\frac {dA}{dt}(t)=lc $. What this means is that if you have an area $A_0$ at some time $t_0$ and wait a very small amount of time $\delta t$, your area then increases to a very good approximation (which gets better the smaller $\delta t $ is) to become the value $A(t_0+\delta t)\approx A_0+ \delta t\cdot \frac {dA}{dt}(t_0)$.

Surely you find this intuitive as it is basically the same as velocity. The above example justifies the identification of "absolute change of a function due to small change of the independant variable" and "rate of change times small change of independent variable". These two things are almost equal and the difference between them becomes smaller if we make the change in the variable smaller. The diference is also small if the function "looks linear" at the initial value of the variable as opposed to "fluctuating vildly". In fact the area of a rectangle is a linear function of obe side's length and the approximation is exactly true in this case. This is the same as "the rate of change being constant", or equivalently "the acceleration being zero".

Now consider what it means to say how much the area of a rectangle changes if we change one of the sides a little bit. The initial area being $A_0$ and increasing one side by $\delta b $ the area increases by a small rectangle $\delta b\cdot l $. Compare the total area after the increase $A (b+\delta b)=A_0+ \delta b\cdot l=A (b)+\delta b \cdot \frac {dA}{db}(b) $ to the formulas above and maybe you will be convinced that the definition of rate of change as a ratio is correct. The approximation is exactly true here because area is a linear function as discussed above.

Related Question