[Math] the exact motivation for the Minkowski metric

differential-geometrysemi-riemannian-geometrysoft-question

In introductory texts about Lorentz Geometry, one always learns about the Minkowski space, i.e. $R^4$ with the Minkowski metric

$$ m(x, y) := -x_0 y_0 + x_1y_1 + x_2y_2+ x_3 y_3 $$

Using this metric, one can define lightcones $\{x \in R^4 \mid m(x, x) = 0 \}$ and timecones $\{x \in R^4 \mid m(x, x) < 0\}$. The existence of these cones is then physically interpreted as the fact that no particle can move faster than light.

But – thinking the other way around – why exactly do I have to use the Minkowski metric to formalize this fact? These cones can be defined also without reference to any metric. I could use $R^4$ with Euclidean metric as my mathematical model of reality. The fact that no particle moves faster than a certain bound is then modelled by the fact that the tangent vectors of curves are within these cones.

Is there an instructive way to see why one has to take the Minkowski metric on $R^4$?

Idea: Maybe the Euclidean metric conflicts with the fact that the speed of light is the same in any reference frame? But how to make this formally precise? How to define speed then?

Best Answer

Consider a 2d world. There are basically three useful cases to consider: Lorentzian (-,+) signature, Galilean (0, +) signature, and Euclidean (+, +) signature. [Yes, perhaps there is a (0,0) signature, but this is especially boring.]

Each of these spaces has a "rotation-like" operation, generated by exponentials of various quantities that are not real numbers but are geometrically significant to the space.

In Euclidean space, that quantity is something like $i$ in that it squares to $-1$ and thus, the rotation-like operation is rotation and the exponential is trigonometric in nature.

In Galilean space, that quantity is instead some $\epsilon$ such that $\epsilon^2 = 0$, and the exponential power series degenerates to $\exp(\epsilon) = 1 + \epsilon$. This generates the simple velocity-addition law of Galilean relativity.

In Lorentzian space(time), the fundamental quantity is some $j$ such that $j^2 =1 $. This is how the exponential generates hyperbolic functions; the rotation-like operation is Lorentz boosting.

What is unique to Lorentzian space is the presence of non-zero vectors that are unaffected by rotation-like operations in planes they occupy. Every vector in Euclidean space is rotated by some angle as long as it lies in the rotation plane. The same goes for Galilean space. But Lorentzian space has the lightlike or null vectors, which go along light cones, which are totally unaffected by boosts.

This is just part of what makes Lorentzian space(time) an attractive model for the physical world; it reduces the constancy of the speed of light to a purely geometric phenomenon. We need not ask ourselves why some object with a given speed should be perceived as having the same speed regardless of our orientation (reference frame). Choosing Lorentzian space(time) as our model makes the existence of such directions of travel patently obvious. The question then becomes why light should travel along those directions, which opens up new avenues of exploration for physics.

Related Question