[Physics] What are waves? Where does the wave equation come from

waves

I'm taking a course on waves and optics using Young and Freedman's University Physics, but I'm a bit confused about a couple of things. I've also looked at Griffiths' Introduction to Electrodynamics and Taylor's Classical Mechanics, and the three of them seem to say the following:

Young and Freedman

  • A wave is (roughly) a disturbance of a system which can propagate from one region of the system to another.
  • Derives the wave function $y(x,t) = A \cos(kx – \omega t)$ for a sinusoidal wave on a string.
  • Uses that to derive the wave equation $\frac{\partial^2 y}{\partial x^2} = \frac{1}{v^2} \frac{\partial^2 y}{\partial t^2}$, where $v$ is the phase velocity.
  • Uses Newton's second law to derive the wave equation for a general wave on a string, this time on the form $\frac{\partial^2 y}{\partial x^2} = \frac{\mu}{F} \frac{\partial^2 y}{\partial t^2}$.
  • Compares the two and concludes firstly that $v = \sqrt{F/\mu}$ for a wave on a string, and secondly that the wave equation with coefficient $1/v^2$ is valid for any wave on a string.

That last step confuses me and doesn't seem to follow from the above. The wave equation with coefficient $1/v^2$ has only been shown to be valid for sinusoidal waves, so how can they conclude that the coefficient has that form for all waves on a string?

Griffiths

  • Recognises the intrinsic vagueness of the concept but provides a tentative description (definition?): "A wave is a disturbance of a continuous medium that propagates with a fixed shape at a constant velocity."

  • Argues that a general wave function has the form $f(z,t) = g(z – vt)$ for any function $g$. (I find this pretty satisfactory.)

  • Derives the wave equation $\frac{\partial^2 f}{\partial z^2} = \frac{\mu}{T} \frac{\partial^2 f}{\partial t^2}$ for a general wave on a string in a similar manner to Young and Freedman. Rewrites the coefficient as $1/v^2$ with $v = \sqrt{T/\mu}$ but doesn't simply state that $v$ is the phase velocity.

  • Starts with the wave function $f(z,t) = g(z – vt)$ and derives the wave equation on the form $\frac{\partial^2 f}{\partial z^2} = \frac{1}{v^2} \frac{\partial^2 f}{\partial t^2}$.

  • Concludes that $v$ is the phase velocity for any wave.

  • Mentions (but does not show) that the general solution is $f(z,t) = g(z – vt) + h(z + vt)$ for some function $h$.

This seems more satisfactory. But something still confuses me: Why start with a general wave on a string? Is it simply for pedagogical reasons, or is it not sufficient to start with the wave function $f(z,t) = g(z – vt)$ and derive the wave equation from that?

Taylor

  • Also starts with a general wave on a string and derives $\frac{\partial^2 u}{\partial x^2} = \frac{1}{c^2} \frac{\partial^2 u}{\partial t^2}$ with $c = \sqrt{T/\mu}$, basically the same as Griffiths I think. He claims that $c$ "is the speed with which the waves travel" without arguing for it yet. (I find him rather vague here. It's a bit unclear exactly what kind of claim this is, whether he thinks it follows from the derivation or not, whether he wants the reader to take it for granted, or if he's going to return to it or not.)

  • Shows that the general solution has the form $u(x,t) = f(x – ct) + g(x + ct)$ for any function $f$ and $g$.

  • Considers the case where $u(x,t) = f(x – ct)$, interprets this solution physically and sees that it describes a disturbance travelling in the positive $x$-direction at speed $c$. I suppose he has now shown that $c$ is the phase velocity.

Griffiths and Taylor seem to do things in opposite directions: Griffiths starts with the wave function and derives the wave equation, and Taylor does the opposite. This leaves me confused as to what a wave is mathematically, and what relation the mathematical description has to the physical phenomenon.

That is, how is a wave defined? Is it a mathematical model which, by construction, fits many physical phenomena (and thus a wave is simply defined as a disturbance that has a wave function or is a solution to the wave equation – and if so, which of the two, function or equation, or are they equivalent?)? Or is it a more or less well-defined class of physical phenomena that just happens to be described by the wave equation (maybe not always)? In that case, what authority does the wave equation have?

Griffiths later derives differential equations for the $\mathbf{E}$ and $\mathbf{B}$ fields in empty space, compares them to the three-dimensional wave equation and concludes that electromagnetic waves exist. If this similarity to the wave equation shows and not just suggests that EM waves exist, then it seems to follow that the wave equation has some authority in determining what counts as a wave. He uses the word "imply", but I don't know what that means.

Best Answer

Mathematically

The waves are solutions of the wave equation: $$\Delta f - \frac{1}{v^2}\frac{\partial^2 f}{\partial t^2} = 0$$

This equation can be solved by many tools. The most elegant method is probably Fourier transformation; it allows us to separate the solution in coordinates (that's useful in some physics applications). The solution you mentioned $f = g(z-v/t) + h(z+v/t)$ is only one solution, but it doesn't cover the whole picture (understand the whole space of solutions).

Physically

We can find wave equation in few basics examples. The first one could be the string vibrating for small initial deviations. The other wave equation can be found in Maxwell equation for field $\vec E, \vec B$ or for scalar and vector potentials $\phi, \vec A$. It only means that these waves are physical, but these waves must still satisfy Maxwell equations.

In physics, this is only a special case of what we call waves. We have Klein-Gordon waves, Dirac waves (you will learn this in quantum field theory) or very simple equation from electrodynamics: waves in conductors. All these waves don't satisfy the wave equation in the classical meaning of \eqref{A}. But we can still call it waves.

So mathematically waves are strictly solutions of the wave equation \eqref{A}, physically we call disturbance in the space that are time-dependent or physical entities carrying information that is propagating from one place to another.

The word "wave" has its origin. Mathematicians in history (in post-renessaince era mostly) were working on the description of the musical instruments. So the first origin of waves was from mathematicians studying physical nature. I can recommend you this article [*].

Edit:

When Maxwell equations are reduced into wave equations, it really means that electromagnetic waves exists. That is because of existence of $\vec E, \vec B$. These fields exists, they carry momentum and energy. So if they are solution of the wave equation, they must be waves.

This led many of physicists to think that there is an aether, which is a medium where electromagnetic waves (as light) can be disturbance. And this is the problem known in special relativity when Michelson and Moorley proved that aether (even if it exists) is not important for electromagnetic waves. This experiment was proof for Einstein special relativity and it started the modern era of the physics.

[*] https://www.jstor.org/stable/41134001?seq=1#page_scan_tab_contents