Linear Algebra – What Does it Mean for Two Matrices to be Orthogonal?

linear algebraorthogonality

Firstly, please bear in mind that I do not have much knowledge about linear algebra. Secondly, I am not asking about some mathematical definitions but rather a more physical definition of this mathematical meaning.

So, the problem is that I can understand the meaning of orthogonality between two vectors, they are just "lines" perpendicular to each other, but I can not physically perceive what orthogonality means for matrices.(like a $3\times 3$ one).
I mean if vectors are like lines in space, those being orthogonal is an easy concept to visualize. But how to visualize two orthogonal matrices?

I think that this is easier to explain to me if you first explain how does a matrix look like in space (visualization):

For example, for a $2 \times 2$, is it the area that its two contained vectors form in space? If it is that, then two orthogonal matrices are simply those two areas that are perpendicular to each other. If not, give me an analogous explanation.*

Best Answer

There are two possibilities here:

  1. There's the concept of an orthogonal matrix. Note that this is about a single matrix, not about two matrices. An orthogonal matrix is a real matrix that describes a transformation that leaves scalar products of vectors unchanged. The term "orthogonal matrix" probably comes from the fact that such a transformation preserves orthogonality of vectors (but note that this property does not completely define the orthogonal transformations; you additionally need that the length is not changed either; that is, an orthonormal basis is mapped to another orthonormal basis). Another reason for the name might be that the columns of an orthogonal matrix form an orthonormal basis of the vector space, and so do the rows; this fact is actually encoded in the defining relation $A^TA = AA^T = I$ where $A^T$ is the transpose of the matrix (exchange of rows and columns) and $I$ is the identity matrix.

    Usually if one speaks about orthogonal matrices, this is what is meant.

  2. One can indeed consider matrices as vectors; an $n\times n$ matrix is then just a vector in an $n^2$-dimensional vector space. In such a vector space, one can then define a scalar product just as in any other vector space. It turns out that for real matrices, the standard scalar product can be expressed in the simple form $$\langle A,B\rangle = \operatorname{tr}(AB^T)$$ and thus you can also define two matrices as orthogonal to each other when $\langle A,B\rangle = 0$, just as with any other vector space.

    To imagine this, you simply forget that the matrices are matrices, and just consider all matrix entries as components of a vector. The two vectors then are orthogonal in the usual sense.