[Physics] Another Solution To Brachistochrone Problem

brachistochrone-problemvariational-principle

Recalling the statement of the problem :

Given two points A and B in a vertical plane, what is the curve traced out by a point acted on only by gravity, which starts at A and reaches B in the shortest time.

And as we can show with the method of variational calculus that the curve is turn out to be cycloid (Figure-A).

$$x=r(\phi-\sin\phi)$$
$$y=r(1-\cos\phi)$$

where $\phi$ is a real parameter, corresponding to the angle through which the rolling circle has rotated. For given $\phi$, the circle's center lies at $(x, y) = (r\phi, r)$

In the brachistochrone problem, the motion of the body is given by the time evolution of the parameter:
$$\phi(t)=\omega t, \ \ \omega =\sqrt{\frac{g}{r}}$$


Now, look at the angle, call $\theta$, that the velocity vector makes with the vertical (Figure-B).

$$\tan(\theta)=\frac{\dot{x}}{\dot{y}}=\tan\frac{\phi}{2}$$
or $$\theta \propto t$$

In other words, In the $\theta-t$ plane, the curve is just a straight line. So What's the intuitive reason that the curve which minimizes the time should be a straight line curve in the $\theta-t$ plane?


enter image description here

enter image description here

Best Answer

A good question! This is best answered by making use of intrinsic coordinates. The reason for this is twofold: firstly, as you have very nearly found, the equation of a cycloid is particularly simple when written in intrinsic coordinates; and secondly, the variational equation for the curve also takes on a a particularly simple form in these coordinates. I will explain both of these points now.

Intrinsic coordinates

In any curvilinear coordinate system , a curve is described by specifying the values of the coordinates as functions of each other e.g. $y=f(x)$, or as a functions of some parameter, e.g. $y=y(t), x=x(t)$. Here, each coordinate is understood to be defined relative to some 'grid'. An alternative, and somewhat esoteric, way to describe a curve is to specify its arc length, $s$, and its angle of inclination (relative to some chosen direction in space), $\psi$. This does not require reference to any fixed grid of coordinates, and instead uses the 'intrinsic' properties of the curve itself.

Intrinsic coordinates

(N.B. this image shows the y axis pointing up - we will take it to be pointing down instead.)

Cartesian coordinates and intrinsic coordinates are related by $$ \frac{\text{d}y}{\text{d}s} = \sin\psi, \qquad \frac{\text{d}x}{\text{d}s} = \cos\psi $$ These awkward equations are part of the reason that intrinsic coordinates are so rarely used. enter image description here

(I haven't quite figured out Geogebra yet!)

As you have seen, for a cycloid described by parametric equations $$ x(\phi) = r(\phi-\sin\phi), \qquad y(\phi) = r(1-\cos\phi) $$ we get $$ x' = r(1-\cos\phi), \qquad y' = r\sin\phi $$ so that $$ \frac{\text{d}y}{\text{d}\phi} \frac{\text{d}\phi}{\text{d}x}=\frac{r\sin\phi}{r(1-\cos\phi)} = \cot\frac{\phi}{2}= \frac{\text{d}y}{\text{d}s}\frac{\text{d}s}{\text{d}x} = \tan\psi \implies \psi = \frac{\pi}{2}-\frac{\phi}{2}. $$ We also have $$ s(\phi) = \int_0^\phi \text{d}\phi'\sqrt{x'^2+y'^2} = r^2\int_0^\phi \text{d}\phi' \sqrt{2-2\cos\phi'}=4r-2r\sqrt{2}\sqrt{1-\cos\phi}\cot\frac{\phi}{2}. $$ so that, combining the two, we get the very simple $$ s(\psi) = 4r(1-\sin\psi). $$ This is the first part done - we have used intrinsic coordinates to show how such a simple expression arises. I will now demonstrate why it is so simple in these coordinates.

Equations of motion in intrinsic coordinates

We now consider the velocity of a particle with position $\mathbf{r}(t)$, and rewrite it in terms of the arc length: $$ \mathbf{v}(t) = \frac{\text{d}\mathbf{r}}{\text{d}t} = \frac{\text{d}\mathbf{r}}{\text{d}s}\frac{\text{d}s}{\text{d}t} = \hat{\boldsymbol{\tau}}\dot{s}, $$ where $\hat{\boldsymbol{\tau}} = \text{d}\mathbf{r}/\text{d}s$ is a unit vector tangent to the path of the particle, and $\text{d}s/\text{d}t=v$ is its speed. Now, using the Frenet-Serret formulas, the time derivative of $\hat{\boldsymbol{\tau}}$ is $$ \frac{\text{d}\hat{\boldsymbol{\tau}}}{\text{d}t} =\frac{\text{d}\hat{\boldsymbol{\tau}}}{\text{d}s}\frac{\text{d}s}{\text{d}t}= \frac{1}{\rho} \hat{\mathbf{n}} \dot{s} $$ where $\hat{\mathbf{n}}$ is a unit vector orthogonal to $\hat{\boldsymbol{\tau}}$. This means that the acceleration of the particle is $$ \mathbf{a} =\frac{\text{d}\mathbf{v}}{\text{d}t} = \frac{\dot{s}^2}{\rho}\hat{\mathbf{n}}+\ddot{s}\hat{\boldsymbol{\tau}}. $$ If the particle moves under gravity, then the equation of motion is therefore $$ m\frac{\dot{s}^2}{\rho}\hat{\mathbf{n}}+m\ddot{s}\hat{\boldsymbol{\tau}} = mg\hat{\mathbf{y}}, $$ where $\hat{\mathbf{y}}$ is a unit vector in the $y$ direction. Taking the dot product with $\hat{\boldsymbol{\tau}}$, and using $\hat{\boldsymbol{\tau}}\cdot\hat{\mathbf{y}} = \sin\phi$, we get the equation of motion for the speed of the particle: $$ \ddot{s} = g\sin\psi.\tag{2} $$ This equation is very important. Since for the Brachistochrone we have the initial conditions $s(0)=\dot{s}(0)=0$, the above equation means that the form of $s(t)$ is entirely determined by $\psi(t)$. In other words, the function $\psi(t)$ entirely determines the shape of the curve.

Functional minimisation

Finally, consider the functional to be minimised for the Brachistochrone problem: the total time $T$ to traverse a curve $C$ under the influence of gravity is $$ T = \int_C \text{d}t $$ As it stands, the integrand is not in a form that allows the functional to be varied: the path dependence of $T$ is not explicit. In order to apply the Euler-Lagrange equations, it is necessary to rewrite the integral to be over a fixed domain. Two common ways to do this are $$ T = \int_C \frac{\text{d}s}{\dot{s}} = \int \sqrt{\frac{1+y'(x)^2}{2gy(x)}}\text{d}x $$ and $$ T = \int \sqrt{\frac{\dot{x}(t)^2+\dot{y}(t)^2}{2gy(t)}}\text{d}t. $$ This works because $y(x)$, or $x(t)$ and $y(t)$ together are sufficient to specify the shape of the curve. This is true more generally: any form of the integrand (over a fixed domain) that will allow the Euler Lagrange equations to return a complete description of the curve is sufficient. But we have just found a single function that does just that, namely $\psi(t)$! Assuming $\psi(t)$ is invertible (which it is over the domain we are interested in) we are therefore allowed to rewrite the integrand as $$ T = \int_C \text{d}t = \int\frac{\text{d}t}{\text{d}\psi} \text{d}\psi. \tag{3} $$ Applying the E-L equations $$ \frac{\text{d}}{\text{d}\psi}\frac{\partial\mathcal{L}}{\partial t'} = \frac{\partial\mathcal{L}}{\partial t} $$ with $\mathcal{L} = \frac{\text{d}t}{\text{d}\psi} = t'$, we find $$ \frac{\text{d}}{\text{d}\psi} t' =0 \implies t' = \text{const} \implies t=A\psi+B \text{ i.e. } \psi(t) = at+b $$ This is exactly what you found before. As I will show below, the parameter $a$ is your $-\omega/2$. (I am stuck on one final detail here - clearly we need $b=\pi/2$, but I can't see a simple reason for this).

To reiterate, it is only possible to write equation (3) because $s(t)$ is entirely determined by $\psi(t)$ by equation (2). This is the reason we are not allowed to write e.g. $T= \int(\text{d}t/\text{d}x) \text{d}x$ - because the function $x(t)$ alone is not sufficient to describe the curve - $y(t)$ is also required.

Finally, setting $b=\pi/2$ and $a=-\omega/2$ integrating the equation of motion (2), we find $$ \frac{\text{d}^2 s}{\text{d}t^2} = g\sin\left(\frac{\pi}{2}-\frac{\omega}{2} t\right) \implies s(t) = -\frac{4g}{\omega^2}\sin\left(\frac{\pi}{2}-\frac{\omega}{2} t\right) +ct+d. $$ With $s(0) = \dot{s}(0) = 0$, we find $c=0, d = 4g/\omega^2$ so that $$ s(\psi) = \frac{4g}{\omega^2}(1- \sin\psi) $$ as before.


References:

  1. https://en.wikipedia.org/wiki/Curvilinear_coordinates
  2. https://en.wikipedia.org/wiki/Frenet%E2%80%93Serret_formulas
  3. https://ocw.mit.edu/courses/aeronautics-and-astronautics/16-07-dynamics-fall-2009/lecture-notes/MIT16_07F09_Lec06.pdf
Related Question