The form of $B_{ij}$ in $U\sigma_i U^\dagger=B_{ij}\sigma_j$, with $\sigma_j$ Pauli matrices

hermitian-matriceslie-algebraslinear algebramatrices

Let $\sigma_i$ denote the Pauli matrices:
$$
\sigma_1\equiv \begin{pmatrix}0&1\\1&0\end{pmatrix}, \quad
\sigma_2\equiv \begin{pmatrix}0&-i\\i&0\end{pmatrix}, \quad
\sigma_3\equiv \begin{pmatrix}1&0\\0&-1\end{pmatrix}.
$$

It isn't hard to see that any $2\times 2$ unitary $U$ can be written in terms of these matrices as
$$ U = c_0 I + \sum_{k=1}^3 ic_k \sigma_k, $$
for some real coefficients $c_j$ normalised to one: $\mathbf c\equiv(c_0,c_1,c_2,c_3)\in S^3$.

It turns out to be the case that
$$ U\sigma_i U^\dagger = \sum_{j=1}^3 B_{ij} \sigma_j, \tag A$$
for any $i\in\{1,2,3\}$, with $B$ a unitary matrix.
I can see why this must be the case by direct analysis on $U\sigma_i U^\dagger$: expanding $U$ in terms of Pauli matrices and using the known expressions for products of Pauli matrices to get to a final expression for $B_{ij}$. My problem with this is that it's a somewhat tedious procedure, and the final expression doesn't make it particularly obvious that $B$ is always unitary.

I am looking for a better way to prove (A), especially because the expression seems to lend itself to be understood on more abstract grounds (I don't know much about Lie theory, but it seems to be saying something on the lines of $U(2)$ acting on its Lie algebra unitarily via the adjoint representation… if that makes sense).

Best Answer

We define an inner product over $\Bbb C^{n \times n}$ by $\frac 1n \langle A,B \rangle = \operatorname{tr}(A^\dagger B)$; this is (a normalized version of what is) known as the "Frobenius" or "Hilbert-Schmidt" inner-product.

Note that for any $U$, the matrices $U\sigma_j U^\dagger$ form an orthonormal basis for the space of trace free $2 \times 2$ matrices (if you like, the orthogonal complement of the span of $I$). That is, we have $$ \langle U\sigma_jU^\dagger,U\sigma_kU^\dagger \rangle = \delta_{jk} $$ where $\delta_{jk}$ is a Kronecker-delta, and every trace-zero matrix can be written as a linear combination of these matrices.

The matrix $B_{ij}$ that you describe is the change-of-basis matrix that takes us from a coordinate-vector relative to the basis $\{U\sigma_jU^\dagger: j =1,2,3\}$ to a coordinate vector relative to the basis $\{\sigma_j: j = 1,2,3\}$. Because where are changing between two orthonormal bases, the resulting change-of-basis matrix is unitary.

Related Question