[Physics] Special Relativity: Transforming Maxwell’s equations

maxwell-equationsspecial-relativity

I'm working through Einstein's original 1905 paper*, and I'm having trouble with the section on the transformation of Maxwell's equations from rest to moving frame.

The paper proceeds as follows:

Let the Maxwell-Hertz equations for empty space hold good for the stationary system K, so that we have

IMAGE: Untransformed Maxwell equations

where (X, Y, Z) denotes the vector of the electric force, and (L, M, N) that of the magnetic force.

If we apply to these equations the transformation developed in § 3, by referring the electromagnetic processes to the system of co-ordinates there introduced, moving with the velocity v, we obtain the equations

IMAGE: Transformed Maxwell equations

I do not understand just where this second set of equations comes from.

The only derivations of this transformation that I have been able to find involve either potentials, four-vectors, or both. However since four-vectors had not been invented in 1905, and because the statement of the transformation is so blunt, it seems that Einstein is using or appealing to a simpler method for finding the transformation.

So my question is: What is the simplest derivation of the transformation rules for Maxwell's equations in special relativity? Is Einstein appealing to a pre-existing Lorentz method here, or is there a trick to accomplish all of this quickly? Or does the text here simply belie the true work involved in the derivation?

  • I cannot give a proper link to the paper owing to link restrictions, but it's at hxxp://www.fourmilab.ch/etexts/einstein/specrel/www/#SECTION21

Best Answer

There exists a remarkably simple explanation for how electromagnetic phenomena must be transformed in a moving frame if: (1) the physics of that frame is to remain invariant; and (2) the speed of light c must remain invariant for both the at-rest observer and within the moving frame. These are Einstein's original two SR postulates, of course. (There is actually a third assumption needed about dimensional scaling orthogonal to the direction of motion. Lorentz discussed that issue explicitly, but Einstein seems to assumed as a given.)

The technique is to use light pendulums to define proper time and length in both frames, then show geometrically how this affects electromagnetics phenomena. I've only used it myself to bring out the energy relations, which is part of Einstein's subsequent very short paper that led to $E=mc^2$ (what a delightfully strange technique that paper uses!). All of the Maxwell transformations are necessarily implicit those energy transformations of moving light pendulums. They have to be, else SR would not work.

I have some light pendulum diagrams on hand that I had intended for use in time dilation discussions. I'll append them to this answer shortly, and explain at least briefly how they can be used to re-interpret electromagnetics and Maxwell's equations. I don't have time for a full treatment (flying tomorrow), but the light pendulum perspective is both helpful and deeply linked to Einstein's very first (and also much later) explanations of why special relativity is an unavoidable consequence of holding c invariant in all frames of motion.


So, the promised Graphical Overkill ensues below. I don't even have time tonight to explain the graphics, but I to promise I'll get back to them.

The initial foray into EM implications is, alas, the very last bullet in the very last slide. But it does show how transformations of both wavelength and energy are inherent and unavoidable within the curious constraints of having to preserve the physics and speed of light in both the frame doing the observing, and in the frame being observed.


3D Light Pendulum, Outbound Phase


3D Light Pendulum, Inbound Phase


2D Light Pendulum


1D Light Pendulum


1D Rho Clock


A Human-Scale Rho Clock


This final graphic is at the very heart of how Einstein first came up with the theory of special relativity. He postulated two things: (1) that the speed of light remains invariant in all unaccelerated frames of reference, and (2) that all physics, including not just mechanics (Galilean relativity) but also electromagnetics, must remain unchanged regardless of frame.

Oh my... what those two postulates do in combination!

Light pendulums give a nicely succinct way of exploring all of those implications using a single device. The rho clocks I use here are simply light pendulums with enough design constraints and frame-specific labels attached to allow those implications to be explored.

So, take a look at this final figure:

When Light Pendulums Move

Notice that the sides of the gray rectangles represent light paths drawn out in spacetime. So, if the speed of light is invariant, guess what? An object that is moving cannot have the shortest possible inbound an outbound light paths, because the light has to constantly pace the object to keep up with it. If the object moves very, very fast, the time it takes for the outbound light to catch up with its leading edge can become very long indeed, as viewed by an observer the "rest" frame labeled 0 ($\phi_0$).

How long? Well, look at the figure: It's just the height of the scaled gray box, which is what the rho clock ($\rho_1$) for frame 1 ($\phi_1$) looks like when viewed from the rest frame $\phi_0$. For shorthand, I call that situation $\rho_{1:>0}$, where "1:>0" just means that frame 1 is being observed by frame 0 (the ":>" is two eyes looking left). And yes, the label on that height is the traditional $\gamma$ of special relativity. The idea of $\gamma$ just emerges a lot more naturally (and a lot more geometrically) in rho diagrams.

As for electromagnetics, recall that both frames must see their physics unchanged. But notice how the pulses of blue light on the left side of the gray rectangle get crowded together to make that true. That doesn't happen just for pulse spacing, it happens for all of electromagnetic theory. So, for example, light traveling along the left branch of the frame 1 rho rectangle ("rho" actually stands for rectangle, not relativity) must increase in frequency to keep the internal physics of frame 1 rho clock unchanged (shorthand: $\rho_{1:>1}$ must remain invariant).

That has huge energy implications over in frame 0, however, since the faster the object travels, the more energetic that side of the rho clock rectangle becomes. If you combine that with some very unusual arguments from Einstein in his second special relativity paper (I like to call it his "asymptotic tautology argument"), you wind up eventually with the famous equation $E=mc^2$. (Side comment: That equation is more famous, but $E^2 = (pc)^2 + (mc^2)^2$ is a lot more useful; just ask any particle physicist.)

The nice thing about Maxwell's equations is that they already accommodated and allowed such unusual forms of scaling long before Einstein and special relativity came around. That is one of the reason why you will see almost breathless praise for Maxwell from Einstein and other physicists involved in the early days of special relativity. The new theory helped them appreciate in new ways just how deep Maxwell's insights had been.

Related Question