[Math] Why can a system of linear equations be represented as a linear combination of vectors

linear algebra

I was watching Gilbert Strang's first Linear Algebra lecture, and the very first thing he does is relating the standard view of a system of linear equations as lines -in $\mathbb{R}^2$ of course- (what he calls the row picture) with the notion of taking a linear combination of the vectors given by the columns of the matrix (the column picture).

Now, I can see it works in practice to reach the same solution, but I don't intuitively understand why this is the case. A priori, they seem like very different things, and it's mysterious that these two views somehow correspond to each other.

Could anyone shed some light on this? I'd greatly appreciate it!

Best Answer

If you are wondering why the linear system $$ 2x-y=1,\\x+y=5\tag{1} $$ and the so called column form (by Strang) $$ x\begin{bmatrix} 2\\1 \end{bmatrix} +y\begin{bmatrix} -1\\1 \end{bmatrix}= \begin{bmatrix} 1\\5 \end{bmatrix}\tag{2} $$

above are the same, then the short answer would be

it is by "definition" so. You will learn the "definition" later to see exactly what (2) means, which will tell you why (1) and (2) are the same thing. Here is a list for what you might want to pay attention to in later lectures:

  • What is a vector $[2,1]^T?$
  • What does it mean by multiplying a real number $x$ to the vector $[2,1]^T$?
  • What does it mean by adding two vectors together?
  • When are two vectors the same?

Here is Strang's own explanation in his textbook: Consider the example:

enter image description here enter image description here enter image description here enter image description here


Related Question