[Physics] How does the Ward-Takahashi Identity imply that non-transverse photons are unphysical in QED

gauge-theoryquantum-electrodynamicsrenormalizationunitarityward-identity

Peskin and Schroeder say that the Ward Identity of QED proves that non-transverse photon polarizations can be consistently ignored, but I'm confused about the details.

Setup

One starts by considering some process with an external photon whose momentum is chosen to be $k^\mu=(k,0,0,k)$ and let the two transverse polarization vectors be $\epsilon_1^\mu=(0,1,0,0)$ and $\epsilon_2^\mu=(0,0,1,0)$. The Ward identity tell us that if the amplitude for the process is $\mathcal{M}=\mathcal{M}^\mu\epsilon_\mu(k)$, where we've factored out the polarization vector for the external photon in consideration, then the amplitude obeys $\mathcal{M}^\mu k_\mu=0$, on shell. With our setup, this simply tells us $\mathcal{M}^0=\mathcal{M}^3$. If we then calculate the square of the amplitude and sum over external polarizations we'd find $|\mathcal{M}|^2=\sum_{i\in\{1,2\}}\epsilon_{i\mu}\epsilon_{i\nu}^*\mathcal{M}^\mu\mathcal{M}^{*\nu}=|\mathcal{M}^1|^2+|\mathcal{M}^2|^2$. Due to the Ward identity, this is equal to $-\eta_{\mu\nu}\mathcal{M}^\mu\mathcal{M}^{*\nu}$ and so we can make the replacement $\sum_{i\in\{1,2\}}\epsilon_{i\mu}\epsilon_{i\nu}\to -\eta_{\mu\nu}$. Peskin and Schroeder claim (pg 160-161) that this is proof that non-transverse photons can be consistently ignored.

Multiple Questions

1) P&S appear to be claiming that had we also summed over non-transverse polarizations we would find that the polarization sum just turns into $-\eta_{\mu\nu}$. However, if I use the two vectors $\alpha_1^\mu=(1,0,0,0)$ and $\alpha_2^\mu=(0,0,0,1)$ as the basis for the non-transverse polarizations then it would appear that the polarization sum would turn into $\delta_{\mu\nu}$ rather than $-\eta_{\mu\nu}$. How does one know that there should be an extra minus sign so that $|\mathcal{M}^0|^2$ comes in with a minus sign relative to all the other squared amplitudes? That is, I'd somehow know that the proper calculation is given by $\sum_{\rm all\ polarizations}|\mathcal M|^2=\mathcal{M}^\mu\mathcal{M}^{*\nu}(\epsilon_{1\mu}\epsilon^*_{1\nu}+\epsilon_{2\mu}\epsilon_{2\nu}^*-\alpha_{1\mu}\alpha_{1\nu}^*+\alpha_{2\mu}\alpha_{2\nu}^*)$, but I don't see where that minus sign would arise from. One could guess that you'd have to end up with something proportional to $\eta_{\mu\nu}$ due to Lorentz-Invariance, but that's not very satisfactory. Finally, the polarization sum is naively a sum of manifestly positive numbers, yet the P&S argument depends on some kind of cancellation between these number, so how is this possible?

2) What about scenarios where one doesn't perform the polarization sum? I thought that the polarization sum was only performed when the detector is insensitive to polarization, which is not always the case at hand. I would have thought that one could show that non-transverse polarization are unphysical without having to do a polarization sum. For example, given $\mathcal{M}^\mu$ I would have thought that we could have contracted this with one of the non-traverse polarization vectors, say $\alpha_2^\mu$, and we should find that the amplitude for this process vanishes by itself.

3) Shouldn't we be showing that we can also ignore non-transverse states in all parts of the diagram, not just on external legs? If non-transverse states can run in loops then they need to be included in physical initial and final states, due to the optical theorem, which needs to be avoided. Or are P&S claiming that they've proved that they've shown that non-transverse polarizations can be ignored in the initial and final states, so by the optical theorem they can be ignored in loops, too?

Best Answer

(1) The completeness relationship for a basis of vectors orthonormal with respect to $\eta_{\mu\nu}$ is \begin{equation} \eta_{ij}\epsilon^{(i)}_\mu \epsilon^{(j)}_\nu = \eta_{\mu\nu} \end{equation} This normalization convention is picked for Lorentz invariance... I know you said you didn't want that answer but the point is that the normalization of these vectors is a matter of convention and it's best to pick a Lorentz invariant one. One advantage of choosing a L.I. normalization is that we don't need to specify the argument: the $\epsilon$ depend on the momentum, but these normalization conditions do not. The $\eta_{ij}$ provides the minus sign you are missing. Also here you see the basic problem that the gauge symmetry fixes: one of the polarization vectors necessarily has a negative norm.

(2) Having said that, $\epsilon_\mu^{0}$ and $\epsilon_\mu^{3}$ are not valid on shell quantities. They are a convenient mathematical fiction, needed to make an orthonormal basis, which allows things to be written in a nice, Lorentz invariant way. But the external legs of Feynman diagrams must be on shell, and as a result you can only put real honest on shell polarization vectors there, and so you aren't allowed to put $\epsilon^{(0,3)}$ there at all. Put another way, you can't satisfy the equations of motion for the photon with the longitudinal and time like modes, but the LSZ formula picks out the external wave functions that satisfy the classical equations of motion. However, since $k_\mu \mathcal{M}^\mu=0$, you could add $0$ in the funny combination $\left(\epsilon^{(0)}_\mu-\epsilon^{(3)}_\mu\right)\mathcal{M}^\mu$, which you can then add to your other basis vectors to form $\eta_{\mu\nu}\mathcal{M}^{*\mu}\mathcal{M}^\nu$ when you square to form the probablity. If the hypocricy of this angers you, that is a natural reaction, you'll eventually just accept it. (Welcome to gauge theory).

(3) EXCELLENT question. You need the off shell formulation of the Ward identity to give a real answer to this, that's in chapter 7 of P&S. Basically there's more to it than just "replace the external polarization vector by $k_\mu$", you can really show that the parts of the propagator proportional to $k_\mu k_\nu$ never matter even in loops. However, in Yang Mills theories the corresponding statement is not true! So your question is exactly on the money for Yang Mills theories, you get contributions in loops from the longitudinal and timelike modes, and by the optical theorem this taken at face value would lead to the production of unphysical particles. The fix is to add yet more unphysical particles to the theory to cancel out these parts of the loop diagrams, they are called Fadeev Popov ghosts.

After flipping through Peskin and Schroder to answer this question, I have to say that they are proving things in a very roundabout way. It's good that it teaches how to think about Feynman diagrams in a very detailed way... But there are other, less painful ways to prove and think about the Ward Identity (such as using the path integral).

Related Question