[Math] When is a vector “glued” to the origin

affine-geometrycalculusdifferential-geometryvector-spacesvectors

Let $V$ be a real finite-dimensional vector space (I guess this forces $V$ to be $\mathbb{R}^n$). My intuition is that a vector $v\in V$ must be "glued" to the origin, since the origin is the only canonical thing that $V$ has (not even the basis is canonical, and I suspect the origin (aka. the zero vector) is the only vector that has the same representation under every basis).

But, in many contexts, vectors are not thought of as being "glued" to the origin, in particular when we think of them as "displacements": a "displacement vector" from $a$ to $b$ can be the same as a displacement vector from $c$ to $d$ under suitable conditions. (Intuitively, a "displacement vector" can be moved around without making it a different vector.)

In stark contrast, a "position" in Euclidean space cannot be moved around without making it a different position, since a position is only equal to itself and no other position.

So "displacements" and "positions" can be written as vectors, but clearly they don't behave the same.

In calculus texts, authors usually switch back and forth between vectors that are "glued" to the origin, and vectors that are "not glued" to the origin. But I find that this obscures the nature of what a vector is, and I'd like a rigorous distinction.

Spivak's A Comprehensive Introduction to Differential Geometry, Volume I (Chapter 3: The tangent bundle) suggests that the appropriate language is that of tangent bundles. Namely, at each point $x\in\mathbb{R}^n$ we have a copy of $\mathbb{R}^n$: its tangent space. So, a point together with its tangent space looks like $(x,\mathbb{R}^n)$. If we let $x$ vary over $\mathbb{R}^n$, I suppose the set of all tangent spaces would be the set $\{(x,\mathbb{R}^n) \ | \ x\in\mathbb{R}^n\}$, which looks suspiciously like $\mathbb{R}^n\times\mathbb{R}^n$, ie. $\mathbb{R}^{2n}$.

Now a "position vector" seems to be a vector in the original ${\mathbb R}^n$, and a "displacement vector" starting at $x \in {\mathbb R}^n$ seems to be a vector in the tangent space $(x, {\mathbb R}^n)$. Then a vector space always has an origin, and a vector is always "glued" to that origin. What allows us to "move vectors around" with impunity is that ${\mathbb R}^n$ is isomorphic to ${\mathbb R}^n$.

Moreover, a motivation for tangent spaces seems to be precisely to formalize the idea of "displacements" on a manifold. How this is related to the idea of affine spaces (which also seem to deal with "displacements"), I don't know.


  1. Is any part of the above discussion correct?
  2. When is a vector "glued" to the origin?
  3. What is a rigorous formulation of "position vectors" and "displacements vectors"? Does it use tangent bundles? Does it use affine spaces?

Best Answer

I think it is helpful to think about these concepts on an arbitrary manifold. In that setting, there is no such thing as a position vector, because the manifold may not be a vector space; positions are specified instead by local coordinates.

The only reason we can identify positions with vectors in Euclidean space and move them around is that (1) Euclidean space is a vector space, and (2) the space carries an affine connection, which supplies a notion of parallel transport. You can't even "move around" tangent vectors on an arbitrary manifold unless you have a connection (though every Riemannian manifold comes with a natural one, the Levi-Civita connection); and in addition, as Ted says in his answer, you can't imagine those tangent vector glued "wherever you like" unless the tangent bundle is trivializable. The tangent bundle on a vector space is always trivializable.


A very useful example to keep in mind is polar coordinates in the Euclidean plane. Here a position is specified by $(r,\theta)$ or, especially in physics and engineering, a "position vector" $r\mathbf{\hat{r}}$. The former is the manifold point of view; the latter exploits the fact that we are laying curvilinear coordinates on what, at bottom, is a vector space. But you quickly learn this thing called a position vector is not really a position vector, because the vector $\mathbf{\hat{r}}$ changes from point to point. It doesn't live in the underlying space; rather, it lives in the tangent bundle, and the tangent spaces change from point to point. We are only able to think of it as a bona fide position vector because we imagine uprooting it from its tangent space and gluing it instead to the tangent space at the origin. But, to repeat, in a general manifold, the underlying space isn't necessarily a vector space and doesn't necessarily have a zero vector.

In general, to specify a position we just gives its local coordinates, and that is that: we don't need a vector to specify a position. You might wonder, then, where velocity vectors come from, because we're so used to getting velocity vectors by differentiating position vectors. But velocity vectors in arbitrary manifolds don't come from differentiating position vectors. They come from partial differentiation in directions specified by the local coordinates. More precisely, given local coordinates $(x_1, x_2)$, we represent a velocity vector as the operator $a\frac{\partial}{\partial x_1}+b\frac{\partial}{\partial x_2}$. (This is part of the reason, I think, many people struggle with the abstract definition of a tangent vector. For years they have been allowed to think of a velocity vector as the derivative of a position vector.)

You can see how the two points of view diverge when you ask a physicist or engineer to tell undergraduates how to represent velocity or acceleration vectors in planar polar coordinates. Most likely, they will draw a nice picture showing why the (time) derivative (along some curve) of $\mathbf{\hat{r}}$ is $\dot{\theta}\mathbf{\hat{\theta}}$. But the picture only works if you can parallel transport the tangent basis vectors $\mathbf{\hat{r}}$ and $\mathbf{\hat{\theta}}$ at a later point back along the curve to the tangent space at the initial point -- and it is exactly this seemingly innocent feature of the picture that requires an affine connection in general. What the calculation is really doing is exploiting the standard Euclidean connection to specify the covariant derivative / Christoffel symbols of planar polar coordinates.