Why is the determinant of a matrix representing a quaternion related to its norm

clifford-algebrasdeterminantnormed-spacesquaternionsrepresentation-theory

A common (complex $2\times 2$) matrix representation of the quaternions is given by $\phi(a+bi+cj+dk)=aI+bi\sigma_1+ci\sigma_2+di\sigma_3$, where $\sigma_i$ are the Pauli matrices, as shown here. In this representation, we have $\det\phi(x)=|x|^2.$ The same relation holds for a standard matrix representation of $\mathbb C$ as real $2\times 2$ matrices.

The proof of these statements is easy, but what's the secret behind these facts? Is there a geometric viewpoint that makes this obvious, or an abstract coordinate-free proof that it must hold for matrix representations satisfying certain conditions, or a generalization to representations of other Clifford algebras?

Best Answer

The complex case is easier so let's start there first. We associate to a complex number $z = a + bi$ the $2 \times 2$ real matrix

$$M(z) = \left[ \begin{array}{cc} a & b \\ -b & a \end{array} \right];$$

abstractly this corresponds to considering the action of $\mathbb{C}$ on itself by left multiplication. The determinant of this matrix is the square norm $a^2 + b^2$ and we'd like a more conceptual explanation of this. The determinant is the product of the eigenvalues of $M(z)$, so what are these eigenvalues? They are exactly the complex number $z$ and its conjugate $\bar{z}$, where the corresponding eigenvectors are $\left[ \begin{array}{c} 1 \\ i \end{array} \right]$ and $\left[ \begin{array}{c} 1 \\ -i \end{array} \right]$.

In turn, one way to see why these must be the eigenvalues is to consider the characteristic polynomial of $M(z)$. This has to be a real quadratic polynomial satisfied by $M(z)$, and hence satisfied by $z$. But clearly if $z$ is not real then its minimal polynomial over $\mathbb{R}$ is

$$(t - a)^2 + b^2 = (t - z)(t - \bar{z}) = 0$$

which is exactly the real polynomial with roots $z, \bar{z}$ and so, by comparing degrees, must be the characteristic polynomial. Since the non-real $z$ are Zariski dense this must be the characteristic polynomial for all $z$. So the determinant is the squared norm $\det M(z) = z \bar{z}$ as desired.


The quaternionic case is similar. We associate to a quaternion $q = a + bi + cj + zk$ a $2 \times 2$ complex matrix $M(q)$ which I don't want to write out explicitly but which abstractly comes from considering the action of $\mathbb{H}$ on itself by left multiplication, together with the complex structure given by right multiplication by any copy of $\mathbb{C}$ inside $\mathbb{H}$, say the copy $\{ a + bi \}$ for concreteness. Again the determinant $\det M(q)$ must be the product of the eigenvalues of $M(q)$. So, what are these eigenvalues?

Again let's consider the characteristic polynomial of $M(q)$. This has to be a complex quadratic polynomial satisfied by $M(q)$, and hence satisfied by $q$ (again, we need to fix a copy of $\mathbb{C}$ inside $\mathbb{H}$, and we are taking $\{ a + bi \}$). But again if $q$ is not real then its minimal polynomial over $\mathbb{R}$ is

$$(t - a)^2 + b^2 + c^2 + d^2 = (t - q)(t - \bar{q}) = 0$$

and if $q$ is neither real nor complex then this must also be its minimal polynomial over $\mathbb{C}$, so must be the characteristic polynomial of $M(q)$. And again since the non-complex $q$ are Zariski dense this must be the characteristic polynomial for all $q$. So the determinant is the squared norm $\det M(q) = q \bar{q}$ again as desired.


Note that ultimately in the quaternionic argument we work with the minimal polynomial over $\mathbb{R}$ and end up not having to talk about $\mathbb{C}$ after all, and in both arguments we ultimately end up not really having to discuss the matrix representations at all. This whole situation is clarified enormously by knowing that there is an abstract definition of the characteristic polynomial of an element of a finite-dimensional algebra (it is "the generic minimal polynomial" in a sense which can be made precise), which generalizes the familiar case of matrix algebras and which does not depend on a choice of embedding of the algebra into a matrix algebra, and which reproduces the characteristic polynomials above. The only exposition I know of it anywhere is in Skip Garibaldi's The characteristic polynomial and determinant are not ad hoc constructions, which is well worth a read.

Related Question