Transforming non-orthonormal coordinates, inverse vs. inverse-transpose for dual-transformation

differential-geometrygeometrylinear-transformationstensors

Consider a point in cartesian and polar coordinates, $P = \{x,y\} = \{r,\theta\}$, such that $x = r \cos\theta$ and $y = r\sin\theta$, where $r$ is a radial distance that is not necessarily unity. I have an arbitrary vector (e.g. a velocity vector) at point $P$, and I know it's cartesian coordinates, $\vec{v} = a^i e_i = a^0 e_0 + a^1 e_1$, where $e_i = \{\hat{x}, \hat{y}\}$, and I want to find the components in a bases that is the polar tangent vectors, $e'_i = \{\hat{r},\hat{\theta}\}$, i.e. $\vec{v} = a^{\prime i} e'_i = a^{\prime 0} e'_0 + a^{\prime 1} e_1$.

(Ultimately I need to do this exercise in 3D spherical coordinates, and do a couple of coordinate rotations in between, but I think this case encompasses my [current] confusion).


Attempt:

I can define a transformation between coordinates based on,
$$e'_j = e_i \, {S^i}_j = e_i \frac{\partial e'_j}{\partial e_i},$$

i.e. a covariant transformation. For example, $e'_0 = \hat{r} = \frac{\partial}{\partial r} = \frac{\partial}{\partial x} \frac{\partial x}{\partial r} + \frac{\partial}{\partial y} \frac{\partial y}{\partial r}.$

Thus the tensor that transforms the bases $\{\hat{x},\hat{y}\} \rightarrow \{\hat{r},\hat{\theta}\}$, can be calculated as

$${S^i}_j = \pmatrix{\frac{\partial e'_0}{\partial e_0} & \frac{\partial e'_1}{\partial e_0} \\ \frac{\partial e'_0}{\partial e_1} & \frac{\partial e'_1}{\partial e_1}} = \pmatrix{\frac{\partial r}{\partial x} & \frac{\partial \theta}{\partial x} \\ \frac{\partial r}{\partial y} & \frac{\partial \theta}{\partial y}} = \pmatrix{\cos\theta & -\frac{1}{r} \sin\theta \\ \sin\theta & \frac{1}{r} \cos\theta},$$

and the values of $\{r,\theta\}$ here are, again, fixed based on the point $P$. Using this, we immediately get, $e'_i = \{\hat{r}, \hat{\theta} \} = \{\hat{x}\cos\theta + \hat{y}\sin\theta, \,\, – (\hat{x}/r)\sin\theta + (\hat{y}/r) \cos\theta \}$, which matches what I get from geometry. So this seems good.

The components of the vectors transform contravariantly, and so the transformation should be given by, ${T^i}_j = \partial e_j \, / \, \partial e'_i$ (NOTE: is this still true, in non-orthonormal bases?). Going through the same procedure, I find the components of the tensor that transforms the coordinates (transposes because these are 'columns') $\{a_0,a_1\}^T \rightarrow \{a'_0, a'_1\}^T$ as,

$${T^i}_j = \pmatrix{\frac{\partial e_0}{\partial e'_0} & \frac{\partial e_1}{\partial e'_0} \\ \frac{\partial e_0}{\partial e'_1} & \frac{\partial e_1}{\partial e'_1}} = \pmatrix{\frac{\partial x}{\partial r} & \frac{\partial y}{\partial r} \\ \frac{\partial x}{\partial \theta} & \frac{\partial y}{\partial \theta}} = \pmatrix{\cos\theta & \sin\theta \\ -r \sin\theta & r \cos\theta}.$$

Now if we try this, we immediately get a problem: $a^{\prime i} = \{x\cos\theta + y\sin\theta, \,\, -rx\sin\theta + ry\cos\theta \}^T$. The $0$th component seems fine, but the $1$th has the wrong dimensions: length squared instead of dimensionless! So it seems like this $T$ tensor should actually be transposed… but then it no longer seems to represent a contravariant transformation.

Am I missing a transpose somewhere? Or am I assuming an orthonormality somewhere where it doesn't exist? Rows-vs-columns, and left-vs-right matrix multiplication has always confused me, so I'm primarily following the indices and their positions, but I've still tried to be consistent with rows and columns. One "reason" to add a transpose, however, could be that while the inverse matrix should transform the components (as apposed to bases), the components are 'columns' while the bases 'rows', so perhaps that adds a transpose? But where does this come from in index notation?
Any help or pointers (even on improper terminology / syntax) is much appreciated!

Best Answer

First some differential topology of surfaces (like $\mathbb R^2$) that helps how to handle the coordinates change techniques to get how the change of components behave.

If you have two parametrization $$\Phi:\Omega\hookrightarrow\Sigma \qquad{\rm and}\qquad \Psi:\Gamma\hookrightarrow\Sigma$$ of a surface $\Sigma$, the jacobians $J\Phi, J\Psi$ of them allows to assign the tangents frames at $p\in\Sigma$ via $$J\Phi(a):\mathbb R^2\to T_p\Sigma, $$ where $a\in\Omega$, $\Phi(a)=p$ and $T_p\Sigma$ is the tangent space at $p$, so $$\partial_0=J\Phi(a)e_0\quad{\rm and}\quad \partial_1=J\Phi(a)e_1.$$ Also $$\tilde \partial_0=J\Psi(b)e_0\quad{\rm and}\quad \tilde \partial_1=J\Psi(b)e_1,$$ with the other parametrization such that $\Psi(b)=p$. Now, one can get a map $\lambda:\Omega\to\Gamma$ which complies $$ \Phi=\Psi\circ\lambda\quad{\rm and}\quad \lambda(a)=b $$ So, by the chain's rule $J\Phi=J\Psi\cdot J\lambda$ and $$J\Phi(a)=J\Psi(b)\cdot J\lambda(a),$$ and $$J\Phi(a)e_0=J\Psi(b)\cdot J\lambda(a)e_0 \quad{\rm and}\quad J\Phi(a)e_1=J\Psi(b)\cdot J\lambda(a)e_1.$$ But, if $$ J\lambda(a)=\left(\begin{array}{cc} \lambda^0{}_0&\lambda^0{}_1\\ \lambda^1{}_0&\lambda^1{}_1\end{array}\right), $$ then \begin{eqnarray*} \partial_0&=&J\Psi(b)(\lambda^0{}_0e_0+\lambda^1{}_0e_1)\\ &=&\lambda^0{}_0J\Psi(b)e_0+\lambda^1{}_0J\Psi(b)e_1\\ &=&\lambda^0{}_0\ \tilde\partial_0+\lambda^1{}_0\ \tilde\partial_1 \end{eqnarray*} and similarly $$\partial_1=\lambda^0{}_1\ \tilde\partial_0+ \lambda^1{}_1\ \tilde\partial_1.$$ For an arbitrary tangent vector in the first coordinates $\vec v=a^0\partial_0+a^1\partial_1$, the new components that would have, will be $$\vec v=(a^0\lambda^0{}_0+a^1\lambda^0{}_1)\tilde\partial_0+ (a^0\lambda^1{}_0+a^1\lambda^1{}_1)\tilde\partial_1, $$ which correspond with the operation $$ \left(\begin{array}{c}a^0\\a^1\end{array}\right) \to \left(\begin{array}{cc} \lambda^0{}_0&\lambda^0{}_1\\ \lambda^1{}_0&\lambda^1{}_1 \end{array}\right) \left(\begin{array}{c}a^0\\a^1\end{array}\right). $$

Now for your case the surface is $\mathbb R^2$, $\Phi=1\!\!1$ the identity maps and $\Psi$ is $$ \left(\begin{array}{c} r\\\theta\end{array}\right)\mapsto \left(\begin{array}{c}r\cos\theta\\r\sin\theta\end{array}\right), $$ Hence for $\lambda$ we have $$ r=\sqrt{x^2+y^2}\quad {\rm and}\quad \theta=\arctan\frac{y}{x}, $$ and its derivative is $$ \left(\begin{array}{cc}\dfrac{x}{\sqrt{x^2+y^2}} &\dfrac{y}{\sqrt{x^2+y^2}} \\ -\dfrac{y}{\sqrt{x^2+y^2}}& \dfrac{x}{\sqrt{x^2+y^2}}\end{array}\right), $$ which happens to be the inverse of the derivative $J\Psi$ but expressed in orthonormal coordinates.

Then the base change with respect to this is $$e_0=\frac{x}{\sqrt{x^2+y^2}}\ e'_0-\frac{y}{\sqrt{x^2+y^2}}\ e'_1 \quad {\rm and}\quad e_1=\dfrac{y}{\sqrt{x^2+y^2}}\ e'_0+\dfrac{x}{\sqrt{x^2+y^2}}\ e'_1, $$ or $$e_0=\cos\theta\ e'_0-\sin\theta\ e'_1 \quad {\rm and}\quad e_1=\sin\theta\ e'_0+\cos\theta\ e'_1, $$ in polar terms.

Now insert those into your linear combination $\vec v=a^0e_0+a^1e_1$ to get $$\vec v=(a^0\cos\theta+a^1\sin\theta)e'_0 +(-a^0\sin\theta+a^1\cos\theta)e'_1. $$ You could see how the multiplication $$\left(\begin{array}{cc} \cos\theta&\sin\theta\\ -\sin\theta&\cos\theta\end{array}\right) \left(\begin{array}{c}a^0\\a^1\end{array}\right), $$ matches how the new components of $\vec v$ are get.

Related Question