It is often Stated that Bell's Theorem is equivalent to the statement: No theory of Local Hidden Variables can reproduce all of the predictions of quantum mechanics. I see nowhere in Bell's Theorem the assumption of hidden variables. As far as I can tell, there is one assumption: locality, which is shown to conflict with quantum mechanics. If I am wrong on this, could someone demonstrate exactly how Bell failed to demonstrate that quantum mechanics is non-local by showing how the assumption of hidden variables is used in his theorem? To me — and by the way Bell — the problem is locality.
[Physics] Assumptions in Bell’s Theorem
bells-inequalitylocalityquantum mechanicsquantum-interpretations
Related Solutions
Bell's theorems indeed rule out simple theories where hidden variables obey local equations. However, no matter how you reason, it's always at some point where you need another assumption. In its simplest form, it is the assumption that two observers, Bob an Alice, have the "free will" to choose along which axis they will measure the spin of a particle (photon, electron, or something else). Well, one could object that in a deterministic theory they have no such free will; their decisions were made in the far past.
But that does not invalidate Bell, because now you can say: Bell's theorem would imply that entangled photons emitted by a physical source are correlated non locally in an unnatural way with the nerves in Bob's and Alice's brains long before they made their decisions. That's called "conspiracy". So now the assumption is: there can't be conspiracy. Can't there? Spacelike non-local correlations in physical states are common in the physical world. In fact, in quantum field theory it's the propagators of all physical particles that describe correlations, and they do not vanish far outside the light cone. But the kind of conspiracy quantum systems seem to display (when described in terms of "hidden variables") looks disgusting to many researchers. So it is usually dismissed. Is "disgusting" a sound mathematical argument? You decide ...
Bell's argument actually fails in a deterministic universe. The argument's fundamental assumption is that the outcome of a measurement on one particle can't depend on the measurement basis chosen by the other experimenter for the other particle at a spacelike separated location. This is usually described as a locality assumption, but there's more to it than that. If the experimenter's choice is actually an inevitable consequence of the state of the universe at earlier times, then it's possible that it's an inevitable consequence of the state at the place and time that the entangled pair of particles was created. If so, the particles could decide then and there how to respond to the measurements that inevitably will happen later, reproducing the prediction of quantum mechanics without any spooky long-distance communication.
It's hard to imagine how elementary particles could know in advance the result of a physical process as complicated as the choice of a measurement basis by an experimenter, nor how that choice could be predictable solely from information localized in a small part of its past light cone, and as far as I know no one has actually proposed a local hidden variable theory in which this happens. But Bell didn't prove it impossible.
So if Jaynes is taking the hardline position that the universe is deterministic and all probabilities are inherently about our lack of knowledge of its state, I think he's at least morally correct, and he may be literally correct in his entire argument. He's right that Bell's argument assumes a Y-causes-X relationship in the probabilities P(X|Y), and he's right that there is no such relationship in subjective probabilities. You need some kind of objective unpredictability in the world for Bell's argument to go through – not quantum unpredictability, but some sort of free choice or true classical randomness at the spacelike separated locations of the measurements.
To the extent that Jaynes acts like the mystery of Bell's theorem is totally resolved just because he can model the quantum prediction in his classical probabilistic logic, I think he's wrong. A deterministic model where particles know the future is not logically impossible, but it would be really frickin' weird, and I don't expect any such model to appear. (Although, quantum mechanics is also weird, and I never would have predicted it, so I suppose I should expect to be surprised.)
Best Answer
The issue is locality of what? Which quantities are assumed to be local?
If you say the reults of all experiments, hypothetical as well as actual, and say that these must be assignable definite values, then this is in conflict with quantum mechanics. But this is the assumption that is called "hidden variables", the reason is that this is the assumption Einstein made in the EPR paper. This is the assumption that if you could perform an experiment to determine a quantity, then it is legitimate to give this quantity a definite value, even if you didn't do the experiment. The hidden variables are just the names of the extra quantities that determine the outcome of those hypothetical experiments.
The results of the quantum experiments themselves are only obviously nonlocal if their results are assumed to be definite things, even in those cases where you didn't actually go on to perform the experiment. This is not accepted in the standard interpretations, although for slightly different reasons in different philosophies.
Suppose you have three polarization settings, A,B,C such that the measurement of A,B,C is always the same between the two distant spins, A and B are 99% correlated, B and C are 99% correlated, and A and C are 96% correlated. This is the starkest violation of Bell's inequality that I know. In this case, you can see the violation without a calculation--- if A and B are the same except for 1 time in a hundred, and B and C are the same except for one time in a hundred, how can A and C be different more than 2 times in a hundred? Obviously they can't, and that's Bell's inequality. But obviously they are, in quantum mechanics, because the probability difference between polarization settings goes as the square of 1 minus the (small) angle squared, so that doubling the (small) angle increases the discrepancy in the measurements by a factor of 4, not by a factor of 2.
In order to argue that this is a locality violation, when you are measuring A on particle 1 and B on particle 2, you have to assume that were you to measure A on particle 2, the result would have been the same. The only locality violations are regarding the answers to these hypothetical questions. But you aren't measuring A on particle 2, so that this is a counterfactual statement. You don't have to believe that counterfactual statements of this sort are meaningful. If you believe that the results of experiments come out of thin air, that they don't come from the laws of physics, but of an irreducible interaction between the particles and the measuring device, which is called an observation.
This is Bohr's language, so I am just stating Bohr's position that the results of experiments come out of thin air. They aren't determined by anything previous. So it makes no sense to say "were you to measure A, the result would have been such and so", because you aren't measuring A. The assumption of "counterfactual definiteness" is the central thing, that it makes sense to talk about the answers to experiments you haven't done. I don't like Bohr's way of stating it, I prefer Everett's, but the two answers are exactly the same when push comes to shove.
The assumption of hidden variables is only used to allow you to talk about what the outcome of the A measurement on particle 1 would have been for those instances where you are doing a B measurement on particle 1. You can determine what they answer would have been using the hidden variables, assuming that the hidden variables determine the outcome of the experiments. You don't have to use the hidden variables if you are willing to assume that it is ok to talk about the value you would have gotten in experiments you haven't done.
Since it is only through considering couterfactual experiments that you get a contradiction, if you don't have counterfactuals, you don't have contradictions. The contradiction is that there is no way to assign consistent definite results to all the possible counterfactuals. You could just refuse to assign answers to hypothetical measurements. Equivalently, you could assign amplitudes with those answers, not definite results. Its only when you try to assign definite results that you get a contradiction.
This argument of Bohr's is very difficult to understand, and, like everything else, it is entirely cleared up by switching to Everett's many-worlds philosophy. This explains exactly what is going on, in a detailed mechanistic way that is equivalent to Bohr except for philosophical mumbo-jumbo. Since it is equivalent to Bohr, modulo philosophy, I don't see any point in retaining Bohr's harder to understand language, except to the extent that it is interesting that you can state things this way too.
In the many-worlds point of view, the A measurement splits the observer, and it splits the observer in a different direction than the B measurement does, it entangles the observer with a different property of the particle. The relative statistics that the two observers see are only determined when the two observers come to talk, and then which copy of the observer over here meets which copy of the other observer over there is always a matter of how they are tilted relative to each other in wavefunction space. The tilt between the copies of the observers is entirely determined by which quantity they chose to measure. It is now obvious that there is no reason that Bell's inequality has to hold in this sort of thing, because the results not only of counterfactual experiments are undertermined, but even the results of actual experiments are undetermined! There is still another copy of you which got the opposite results!
Bohr's philosophy is obtained by focusing on one branch, and rejecting the other counterfactual branches as non-existing.
I suppose you could see even the Everett version as violating locality, because the world-label (and the wavefunction) are global constructs. But I don't think this is the best practice. If the world is quantum mechanical, those experiments which split observers will always be indefinite.
Semi-local classical illustration
Since many-worlds requires some slightly nonobvious philosophy, it is best to make the argument in a purely classical universe, where there are no philosophical headaches. You can violate Bell's inequality inside a semi-local classical theory, with world-splitting. The theory will be a classical analog to the Many-Worlds quantum theory, but no philosophical headaches, because the splitting will be built into the theory, and not emerge from philosophical contortions.
The theory will be local in that it will obey the principle:
It will not be local in this sense:
These variables describe the analog of the entanglement between A and B.
Assume a classical Newtonian world, particles interacting by a retarded potential (so that you have relativistic locality), but with a secret world integer W. The world variable is just a stupid parameter that tells you which world you are on.
Two particles only interact with each other when they have the exact same value of the variable "W" (so that W is not a position, exactly, but a sheet-number), and initially, there is one copy of the particles on world 0.
The particles can, in addition to the Newtonian stuff, also do a "fork", which adds a new world to the list of all existing worlds (a new integer label), and copies all the particles nonlocally at their current positions into the new world, and the particle which is forked is altered in a subtle way--- it has a coupling to a forking-detector material (this can all be done explicitly in a computer model).
The copying of particles to the new world doesn't have to happen right away--- since the new world is identical to one of the old worlds, you can use the old world variables right up to the moment of first altered interaction, which propagates outward from the fork position at the speed of light.
Further, the forks have internal variables, the phases. Any two forks have a relative phase, which is a variable on the unit circle. When you fork A and fork B at angle $\theta$, you make worlds in the proportion: $cos^2(\theta)$ AB's and A'B' , $sin^2(\theta)$ AB' and A'B's. In the preceding, A and A' are the two splits descended from A, and B and B' are the two splits descended from B, and the relative angle only tells you how correlated the two splits are with each other, by arranging more copies at the point of collision (when an A split meets a B split) in the proportion determined by their relative angle.
This world-splitting classical model reproduces the quantum situation. This is not a surprise, because it is just made up to do so.
The point of this is that the quantum mechanics is only as nonlocal as this classical model. It is not clear whether the splitting and the angles are to be considered nonlocal, since they are only nonlocal in the sense of extra data, not nonlocal in the sense of influence. All the influences travel strictly at less than the speed of light.