[Math] Why is an orthogonal matrix called orthogonal

linear algebramatricesorthonormalterminology

I know a square matrix is called orthogonal if its rows (and columns) are pairwise orthonormal

But is there a deeper reason for this, or is it only an historical reason? I find it is very confusing and the term would let me assume, that a matrix is called orthogonal if its rows (and columns) are orthogonal and that it is called orthonormal if its rows (and columns) are orthonormal but apparently that's not conventional.

I know that square matrices with orthogonal columns have no special interest, but thats not the point. If I read the term orthogonal matrix my first assumption is, that its rows (and columns) are orthogonal what is correct of course, but the more important property is that they are also orthonormal


So, Question:
Why do you call an orthogonal matrix orthogonal and not orthonormal?
Wouldn't this be more precisely and clearly?

Best Answer

A affine transformation which preserves the dot-product on $\mathbb{R}^n$ is called an isometry of Euclidean $n$-space. In fact, one can begin without the assumption of an affine map and derive it as a necessary consequence of dot-product preservation. See the Mazur Ulam Theorem which shows this result holds for maps between finite dimensional normed spaces where the notion of isometry is that the map preserves the norm. In particular, an isometry of $\mathbb{R}^n$ can be expressed as $T(v)=Rv+b$ where $R^TR=I$. The significance of such a transformation is that it provides rigid motions of Euclidean $n$-space. Two objects are congruent in the sense of highschool geometry if and only if some rigid motion carries one object to the other.

My Point? this is the context from which orthogonal matrices gain their name. They correspond to orthogonal transformations. Of course, these break into reflections and rotations according to $\text{det}(R)= -1,1$ respective.

Likely thought: we should just call such transformations orthonormal transformations. I suppose that would be a choice of vernacular. However, we don't, so... they're not orthonormal matrices. But, I totally agree, this is just a choice of terminologies. Here's another: since $R^TR=I$ implies $R$ is either a rotation or reflection let's call the set of such matrices rotoreflective matrices. In any event, I would advocate the change of terminology you advocate, but, the current terminology is pretty well set at this point, so, good luck if you wish to change this culture.

Related Question