[Physics] Which global symmetry of Minkowski space (if any) gets gauged to the diffeomorphism invariance of general relativity

diffeomorphism-invariancegauge-theorygeneral-relativitylorentz-symmetry

Minkowski space has both translational and Lorentz symmetry, which together give Poincare symmetry. (It also has some discrete symmetries like parity and time-reversal that I won't be concerned with.) In some senses, it seems natural to think of the diffeomorphism invariance/general covariance of general relativity as the "gauged" version of some of these symmetries. But which ones?

1) The equivalence principle is often stated as "spacetime always looks locally like Minkowski space," or "the value of a scalar contracted from Lorentz-covariant tensors at the same point in spacetime is coordinate-invariant," or something along those lines. It seems to me that if you only look at an infinitesimal patch of spacetime, then you can't really talk about translational invariance (which would move you outside the patch), so the symmetry group of that tiny region of spacetime should be thought of as the Lorentz group rather than the Poincare group. If Lorentz symmetry now holds locally at each point in spacetime, then you can say that we have "gauged" the Lorentz group.

2) On the other hand, the "conserved current" in GR is the local conservation of the stress-energy tensor $\nabla_\mu T^{\mu \nu} = 0$.

(I know this will unleash a torrent of commentary about whether GR is a gauge theory, and Noether's first vs. second theorem, and conservation laws that are mathematical identities vs. those only hold under the equations of motion, and so on. Those question have all been beaten to death on this site already, and I don't want to open up that Pandora's box. Let's just say that there is a formal similarity between $J^\mu$ in E&M and $T_{\mu \nu}$ in GR, in that their conservation is trivially true under the equations of motion $\partial_\nu F^{\mu \nu} = J^\mu$ and $G_{\mu \nu} \propto T_{\mu \nu}$ and leave it at that.)

But this is just the diffeomorphism-covariant version of the result $\partial_\mu T^{\mu \nu} = 0$ in Minkowski-space field theory, which is the Noether current corresponding to translational symmetry (As opposed to generalized angular momentum, which corresponds to Lorentz symmetry.) This seems to imply that the natural interpretation of diffeomorphism invariance is as the gauged version of translational symmetry.

Is it more natural to think of diffeomorphism invariance (or the general covariance of GR, which depending on your definitions may or may not be the same thing) as the gauged version of (a) Lorentz symmetry or (b) translational symmetry? Or (c) both (i.e. Poincare symmetry)? Or (d) neither, and these are just vague analogies that can't be made rigorous? If (a) or (b), then why does only a proper subgroup of the Poincare group get gauged? And if (c), then why does only the translational part of the gauge group seem to correspond to a conserved current?

(BTW, I'm looking for a high-level, conceptual answer, rather that one with a lot of math jargon.)

Best Answer

First, general relativity is not a gauge theory in the narrow sense (of having a gauge field) if you consider the second-order formalism in which only the metric is dynamical. The Einstein-Hilbert action conceived of as an action where the only dynamical field is $g$ still has spacetime dependent symmetries ($\mathrm{GL}(n)$-valued transformations acting like the Jacobians of diffeomorphisms on all fields), so it has gauge symmetries and consequently gauge freedom (e.g. the one used below in the spin connection formalism to "diagonalize the metric" at every point), but it does not have a dynamical gauge field. However, there are (at least) two ways to formulate the theory of the Einstein-Hilbert action in terms of a gauge field:

General relativity is a gauge theory with either the general linear group $\mathrm{GL}(n)$ or the Lorentz group $\mathrm{SO}(n-1,1)$ playing the role of the gauge group, depending on your formulation, if you're willing to relax the usually strict requirement that only gauge-invariant quantities are physically meaningful - while Lorentz invariant quantities are more useful in generic computations than, say, vectors, no one claims you can't measure a vector in a given frame. Additionally, GR is not a "free" gauge theory (in the sense of Yang-Mills or Chern-Simons) coupled to something, the gauge field is never the sole dynamical variable, but always coupled to either the metric or the vielbein, so there's another sense in which it doesn't conform to our usual notion of gauge theory.

The two formulations are as follows:

  1. Classical (Palatini) formalism: In the first-order formulation (Palatini formalism, so also this question) of GR, the dynamical fields are the metric and the Christoffel symbols. Examining the transformation behaviour of the Christoffels (as I do in this answer), it is straightforward to see that they transform precisely like a $\mathrm{GL}(n)$-gauge field. It is rather crucial to note that diffeomorphism invariance is not the same as gauged $\mathrm{GL}(n)$-invariance - the former is a basic aspect of all "coordinate-invariant physics", while the latter essentially arises because the Ricci scalar in the Einstein-Hilbert action is analogous to the gauge-invariant $\mathrm{Tr}(F)$ terms in ordinary gauge theories. Yes, this is often claimed otherwise, and yes, I am sure that diffeomorphisms are not gauged versions of anything. However, diffeomorphisms induce $\mathrm{GL}(n)$ gauge transformations through their Jacobians, see again the answer about the transformation behaviour of the Christoffels I linked above.

  2. Spin connection formalism: Instead of conceiving of the tangent bundle as associated to a $\mathrm{GL}(n)$-frame bundle, a manifold of signature $p,q$ has naturally a reduction of the frame bundle to a $\mathrm{SO}(p,q)$ frame bundle, which you may think of as just the bundle of all orthonormal bases relative to the given metric of signature $p,q$, whereas the $\mathrm{GL}(n)$ bundle is the bundle of all bases. The physicist knows this reduction as the tetrad or vielbein formalism, and it allows us to reduce the $\mathfrak{gl}(n)$-valued gauge field $\Gamma$ that is the Christoffels to a $\mathfrak{so}(p,q)$-valued gauge field that is the spin connection $\omega$ essentially by a smooth choice of orthonormal (non-coordinate) basis all over spacetime, which I explain in a bit more detail in this answer. The dynamical fields in the spin connection formalism are the spin connection and the vielbein.

As supplementary evidence that the slogan that "diffeomorphism invariance is a gauge invariance" is false, I urge you to consider that ordinary Yang-Mills theory is also perfectly "diffeomorphism invariant": The Yang-Mills action $$ \int_M \mathrm{tr}(F\wedge{\star}F)$$ has no dependence on coordinates whatsoever either, it is not more or less "diffeomorphism invariant" than the Einstein-Hilbert action is. The significance of "diffeomorphism invariance" in GR is really much more that, as I said above, the Jacobians of diffeomorphisms are the natural source for the gauge transformations of the Christoffels, and that the theory would also be separately invariant just under the $\mathrm{GL}(n)$ transformations without considering an underlying diffeomorphism.

Related Question