[Math] Help understanding the range and kernel of a linear transformation

linear algebralinear-transformationsvector-spaces

I'm having some trouble understanding the Range and Kernel of a linear transformation. The definition goes as follows:

Let $T:V \longrightarrow W$ be a linear transformation. Define the sets $ker(T) = \{v\in V : T(v) =0\}$, $R(T) =\{w\in W : w=T(v)\; for\; some \; v\in V\}$. $ker(T)$ is the kernel and $R(T)$ is the range.

Then this theorem follows:

If $T:V \longrightarrow W$ is linear, then $ker(T)$ is a subspace of V and $R(T)$ is a subspace of $W$.

I'm not really getting it. Could someone please give a simple, yet detailed, explanation along with an example of what the Kernel and Range is and what the theorem that follows wants to say?

Thanks.

Best Answer

The definitions are there to highlight sets that are important to understanding the properties of the linear transformation T. Since $T:V\rightarrow W$ the kernel of T is every element of $V$ that T transforms into $0$. The range of T is every element of $W$ that is a transformation of an element of $V$. So, some simple examples:

  1. Let $T:\mathbb{R} \rightarrow \mathbb{R}$ be given by $T(x)=x$. Then Ker$(T)$ = $\{0\}$ (no other element of $\mathbb{R}$ is zero and T is the identity map) and Range$(T)$ is $\mathbb{R}$ because every element of $\mathbb{R}$ is used up by T.

  2. Let $T:\mathbb{R}^2 \rightarrow \mathbb{R}^2$ be given by $T(x,y) = (x+y, x-y)$. The kernel here is all elements of $\mathbb{R}^2$ that map to $(0,0)$ under T. This means solving the simultaneous equations $x+y=0$ and $x-y=0$ and you can see that $(0,0)$ is the only solution. So Ker$(T)=\{(0,0)\}$. Range$(T)$ is $\mathbb{R}^2$ again, because if you pick any target point $(\alpha, \beta)$ and solve the simultaneous equations $x+y=\alpha$ and $x-y=\beta$ then you find $x=(1/2)(\alpha+\beta)$ and $y={1/2}(\beta-\alpha)$ , i.e. there is a value (x,y) that T turns into $(\alpha, \beta)$.

What is the theorem telling you? It's telling you that these sets have structure; they're not just random collections of points. In both the above examples the Kernel consists of the origin, and so is a 0-dimensional subspace. If we had an example where the kernel was bigger, it would have to have at least 1 dimension (subspaces have integer dimensions), so it would be a line (or plane, or hyperplane as the number of dimensions increase). In other words, all the elements that T map to zero are related to each other: you can find the line that T maps to zero (that's what the kernel gives you).

As an example here, consider $T:\mathbb{R}^2 \rightarrow \mathbb{R}^2$ given by $T(x,y) = (x-y, 0)$. The kernel of T is now $\{(x,y) \in \mathbb{R}^2 : x=y\}$. This is a line in $\mathbb{R}^2$, and T maps any point on it to $(0,0)$.

The range also has structure in the same way (but you expect this because T has structure and T defines the range).

Note also that if the kernel of a linear transformation is just the zero element then the transformation must be injective (one-to-one), which is often very useful to know.

Related Question