[Physics] What do the Pauli matrices mean

angular momentumlie-algebraquantum mechanicsquantum-spinspinors

All the introductions I've found to Pauli matrices so far simply state them and then start using them. Accompanying descriptions of their meaning seem frustratingly incomplete; I, at least, can't understand Pauli matrices after reading them at all.

My current understanding and confusion is demonstrated below. I'd be ever so grateful if someone could fill in all the holes, or poke new ones where appropriate.

Spinors looks like column vectors, i.e.
$$s = \left(\begin{matrix}1\\0\\1\end{matrix}\right)$$
and are used so that rotation in three dimensions (using complex numbers) can be linearly transformed. What does the example spinor above mean? A spin value of 1 in the x and z directions? How can spin-$\frac{1}{2}$ be represented with just 1s then?

A three dimensional vector is used to construct the Pauli matrix for each dimension. E.g., for spin-$\frac{1}{2}$, the vectors used for x, y and z are $v_x =(1,0,0)$, $v_y=(0,1,0)$ and $v_z=(0,0,1)$. You transform them each to the relevant Pauli matrix by the following equation, using dimension x for demonstration,
$$
P^x=\left(\begin{matrix}
v_3^x&v_1^x – i v_2^x\\
v_1^x+i v_2^x&-v_3^x
\end{matrix}\right)
$$
where superscript denotes dimension, not power.

Once you have these matrices, you operate on the spinors with them. What does this do?

You can also find the eigenvalues and eigenvectors for the matrix, which can be used to find the probability that a particle, if measured to have a certain spin in one dimension, when measured next will have spin in another dimension that you choose. I don't understand how this works. What does the eigenvalue and eigenvector in this sense physically represent, and how does spin up and down fit into this? E.g. If you had a spin-1 particle that you knew was spin up in the x direction, what would you do to find the probability of it having spin up or down in the z or y dimension when next measured?

Concrete examples would probably help my understanding a lot.

Best Answer

Let me first remind you of (or perhaps introduce you to) a couple of aspects of quantum mechanics in general as a model for physical systems. It seems to me that many of your questions can be answered with a better understanding of these general aspects followed by an appeal to how spin systems emerge as a special case.

General remarks about quantum states and measurement.

The state of a quantum system is modeled as a unit-length element $|\psi\rangle$ of a complex Hilbert space $\mathcal H$, a special kind of vector space with an inner product. Every observable quantity (like momentum or spin) associated with such a system whose value one might want to measure is represented by a self-adjoint operator $O$ on that space. If one builds a device to measure such an observable, and if one uses that device to make a measurement of that observable on the system, then the machine will output an eigenvalue $\lambda$ of that observable. Moreover, if the system is in a state $|\psi\rangle$, then the probability that the result of measuring that quantity will be the eigenvalue of the observable is \begin{align} p(\lambda) = |\langle \lambda|\psi\rangle|^2 \end{align} where $|\lambda\rangle$ is the normalized eigenvector corresponding to the eigenvalue $\lambda$.

Specialization to spin systems.

Suppose, now, that the system we are considering consists of the spin of a particle. The Hilbert space that models the spin state of a system with spin $s$ is a $2s+1$ dimensional Hilbert space. Elements of this vector space are often called "spinors," but don't let this distract you, they are just like any other vector in a Hilbert space whose job it is to model the quantum state of the system.

The primary observables whose measurement one usually discusses for spin systems are the cartesian components of the spin of the system. In other words, there are three self-adjoint operators conventionally called $S_x, S_y, S_z$ whose eigenvalues are the possible values one might get if one measures one of these components of the system's spin. The spectrum (set of eigenvalues) of each of these operators is the same. For a system of spin $s$, each of their spectra consists of the following values: \begin{align} \sigma(S_i) = \{m_i\hbar\,|\, m_i=-s,-s+1,\dots, s-1,s\} \end{align} where in my notation $i=x,y,z$. So for example, if you build a machine to measure the $z$ component of the spin of a spin-$1$ system, then the machine will yield one of the values in the set $\{-\hbar, 0, \hbar\}$ every time. Corresponding to each of these eigenvalues, each spin component operator has a normalized eigenvector $|S_i, m_i\rangle$. As indicated by the general remarks above, if the state of the system is $|\psi\rangle$, and one wants to know the probability that the measurement of the spin component $S_i$ will yield a certain value $m_i\hbar$, then one simply computes \begin{align} |\langle S_i, m_i |\psi\rangle|^2. \end{align} For example, if the system has spin-$1$, and if one wants to know the probability that a measurement of $S_y$ will yield the eigenvalue $-\hbar$, then one computes \begin{align} |\langle S_y, -1|\psi\rangle|^2 \end{align}

Spinors.

In the above context, spinors are simply the matrix representations of states of a particular spin system in a certain ordered basis, and the Pauli spin matrices are, up to a normalization, the matrix representations of the spin component operators in that basis specifically for a system with spin-$1/2$. Matrix representations often facilitate computation and conceptual understanding which is why we use them.

More explicitly, suppose that one considers a spin-$1/2$ system, and one chooses to represent states and observables in the basis $B =(|S_z, -1/2\rangle, |S_z, 1/2\rangle)$ consisting of the normalized eigenvectors of the $z$ component of spin, then one would find the following matrix representations in that basis \begin{align} [S_x]_B &= \frac{\hbar}{2}\begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix} = \frac{\hbar}{2}\sigma_x\\ [S_y]_B &= \frac{\hbar}{2}\begin{pmatrix} 0 & -i \\ i & 0 \end{pmatrix} = \frac{\hbar}{2}\sigma_y\\ [S_z]_B &= \frac{\hbar}{2}\begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix} =\frac{\hbar}{2}\sigma_z\\ \end{align} Notice that these representations are precisely the Pauli matrices up to the extra $\hbar/2$ factor. Moreover, each state of the system would be represented by a $2\times 1$ matrix, or "spinor" \begin{align} [|\psi\rangle]_B = \begin{pmatrix} a \\ b\end{pmatrix}. \end{align} And one could use these representations to carry out the computations referred to above.

Related Question