The benefit of defining a positive norm for vectors

inner-productslinear algebranormed-spaces

I read that the reason we have the property $\langle A|B\rangle=\langle B|A\rangle^*$ is to make define a positive norm with the formula $\langle A|A\rangle$.

But I do not understand how having this norm benefits us. I guess we're doing this to make an analogy with arrows, which also have a positive norm.

But this can't be the only reason. After all, a lot of things which are true for arrows are not true for general vectors. For instance, angle values of $-2\pi$ to $2\pi$ are not carried over from arrows to general vectors. The formula for the angle between general vectors, $\cos \theta=\frac{\langle A|B\rangle}{|A||B|}$, can result in complex values of $\theta$. The commutativity of the inner product isn't carried over from arrows to general vectors either (though this is the very reason a positive norm gets carried over).

Keeping a positive norm for general vectors must be allowing us to carry over some nice properties from the world of arrows to general vectors. What are those nice things?

Like, even if we drop this property, we'd still be able to prove the existence of an orthonormal basis, as Gram Schmidt does not require $\langle A|B\rangle=\langle B|A\rangle ^*$. So at least that stuff still works out.

EDIT- I just realised that, while Gram Schmidt may not require $\langle V|V\rangle$ to be strictly positive, it does require $\langle V|V\rangle$ not to be 0 for non-zero vectors $|V\rangle$, because only then can we rescale the basis vectors by their norm to get a unit vector.

EDIT- I also realised that the Cauchy Schwarz and Triangle Inequalities would no longer make sense without this norm. Maybe these are useful results too.

Best Answer

$\newcommand{\Cpx}{\mathbf{C}}$Context suggests you're asking about inner products in a (finite-dimensional) complex vector space and asking why the usual definition imposes conjugate symmetry rather than the symmetry imposed for real inner products.

Let's look at the situation in $\Cpx$, writing $A = x + iy$ and $B = x' + iy'$ with $x$, $y$, $x'$, and $y'$ real. The "symmetric" (or complex-bilinear) definition of the inner product is $$ AB = (x + iy)(x' + iy') = (xx' - yy') + i(xy' + x'y). $$ By contrast, the "Hermitian" (or "conjugate-linear") definition is $$ A^{*}B = (x - iy)(x' + iy') = (xx' + yy') + i(xy' - x'y). $$ Suppose we're looking for a generalization of the real inner product. Which should we pick?

For the first, neither component (real or imaginary part) is the Euclidean dot product. For the second, the real part is the Euclidean dot product under the obvious identification of $\Cpx$ with the real plane. Score one for conjugate-linearity.

There's a fringe benefit to the second: The imaginary part is another friend of ours, the determinant or area form in the plane.

These observations generalize for complex vectors with $n > 1$ components, i.e., to the complex vector space $(\Cpx^{n}, +, \cdot)$: The real part of the conjugate-linear inner product is the Euclidean dot product on the real vector space $\Cpx^{n}$, and for free we pick up interesting extra structure (a skew-symmetric real-bilinear function) in the imaginary part.

Since an arbitrary finite-dimensional complex vector space is isomorphic to a complex Cartesian space, it's natural to adopt conjugate-linearity when we define an inner product on a vector space with complex scalar multiplication.

Added: In addressing why we prefer Hermitian symmetry for complex inner products, I may have missed the motivation about "What do we lose if an inner product is indefinite (but non-degenerate)?"

The edits to the question mention linear-algebraic properties we lose with an indefinite inner product. Another is "homogeneity of direction": In a Euclidean space (positive-definite inner product), for each pair of lines through the origin there exists a linear isometry carrying one line to the other.

When we have an indefinite inner product, this is no longer the case; we have homogeneity on the sets of "positive" lines, of "null" lines, and of "negative" lines. That in turn means some lines through the origin do not hit the unit sphere.

This is getting a little far afield, but from my perspective as a geometer, the great "loss" in dropping positivity is not linear-algebraic but topological: In an indefinite inner product space we lose compactness of the unit sphere.

Here's a basic example of the resulting complications: Suppose $(M, g)$ is a compact manifold equipped with a Riemannian metric (positive-definite at each point). At each point, the sphere of unit vectors is compact. Consequently (because local product structure), the unit sphere bundle is compact. Now, unit-speed geodesics of $(M, g)$ may be viewed as integral curves of a unit vector field on the unit sphere bundle. Compactness of the unit sphere bundle ensures completeness of the flow: On a compact Riemannian manifold, geodesics exist for all time.

On a compact indefinite (pseudo-Riemannian) manifold, however, geodesic flow can be incomplete. Einstein Manifolds by A. Besse contains an example of a Lorentz-signature metric on an ordinary $2$-torus with this property. If the light cones (null directions of the metric) tilt suitably from point to point, there is a unit-speed timelike geodesic that in finite time winds infinitely many times around the torus.

This isn't to say mathematicians can't work with indefinite metrics, just that doing so is technically more challenging, and some properties of positive inner products do not generalize to indefinite non-degenerate inner products.

Best Answer

Related Solutions

[Math] What matrices preserve the $L_1$ norm for positive, unit norm vectors

[Math] The shortest distance problem by using Gram-Schmidt Process

Related Question