Relearning Matrices as Linear Functions

Relearning Matrices as Linear Functions(dhruvonmath.com)

294 points by dhruvp 7 years ago | 93 comments

Linear Algebra, at least at my school, is taught pretty poorly. Instead of teaching the beauty of transformations, the course is boggled down in numerical nonsense and tedious calculations (who wants to find the inverse of a 3x3 matrix? Bueller? Bueller?). Only after learning Algebra and homomorphisms, isomorphisms and automorphisms did I appreciate the importance of linear transformations. Stuff like Singular Value Decomposition gets a lot more interesting once you know some basic Algebra. I suppose Linear can't get too abstract because non math majors have to take it, but starting from generalized ideas of transformations is a far better way to teach it imo.

postsantum 7 years ago | |

That was exactly my experience. Struggled with matrices theory at uni doing some bullshit exercises but started to grasp the topic only when I needed to apply some linear transformation in a game

datasciencetext 7 years ago | | |

I think the situation has improved somewhat as visualization tools have become easier to use. We made this simple visual [1] to help people understand what they might get out of linear algebra, and it was easy enough for some statisticians to accomplish.

[1]https://datasciencetexts.com/subjects/linear_algebra.html

swiley 7 years ago | |

At mine we have an "applied" linear algebra course for engineering students and the "normal" one that math people take.

I didn't pass the "normal" one but I think that's because I had another 300 level math course, capstone, and four other CS courses at the same time. I'm certain I wouldn't have passed the applied one, it looked very tedious and that's usually what gets me with homework.

andrewla 7 years ago |

It took until I started learning differential geometry in the form of General Relativity to arrive at this insight, even though I feel like the notion of a matrix as a linear map was drilled in pretty thoroughly. The notion of matrix multiplication as function composition was presented almost as an interesting side effect of matrix multiplication -- that is, multiplication by these rules came first, and, hey, look, they compose!

Personally I found the prospect of tensor algebra to be much more intuitive than either of these; with matrices thrown in mostly as a computational device. Even a vector (through the dot product) is just a linear function on other vectors, and the notion of function composition carries through to that and to higher-order tensors.

Covariance and contravariance are a little more complicated to completely grok, but for most applications in Euclidean space (where the metric is the identity function) the distinction is of more theoretical interest anyway.

chadcmulligan 7 years ago | |

I found this series by eigenchris helpful to understand tensors https://www.youtube.com/watch?v=8ptMTLzV4-I&list=PLJHszsWbB6...

ijidak 7 years ago | |

The metric?

throwawaymath 7 years ago | | |

A metric is a distance function. Defining a metric on a space is one of ways you create a topology.

I'm not sure what the parent means by the metric being the identity function, however. The Euclidean metric is basically the hypotenuse of a triangle parameterized by two vectors. The adjacent and opposite sides of the triangle are measured to be the Euclidean norm of each vector (their length), and the hypotenuse is the shortest distance between them.

The Euclidean metric is not the only metric - you can define distance however you'd like as long as it's consistent. But I'm not sure how the identity function works as a metric, because that would map a vector to another vector, not a scalar.

munchbunny 7 years ago |

In my high school matrices were first taught in geometry class, starting with using matrices as affine transformations in 2-d and then 3-d, and using that to teach concepts like what eigenvectors/values are, the equivalence of matrix and function composition, etc.

That was taught right after a unit on complex numbers and trigonometry so that we could see the parallels between composing polynomial functions on complex numbers and composing affine transformations.

To this day I think that was one of the most beautiful and eye opening lessons I've had in mathematics.

In hindsight, I think I got lucky that the teachers who wrote the curriculum this way were math, physics, and comp sci masters/phd's who looked at their own educations and decided that geometry class was a great Trojan horse for linear algebra.

rramadass 7 years ago | |

You certainly were lucky to be taught Linear Algebra in such a manner! I came to understand the importance of such an approach only after a lot of head-scratching and self-study. IMO, a beautiful and important branch of "Practical" Maths has been needlessly obscured by the pedantic formalism espoused by the teaching community. Linear Algebra SHOULD always be taught alongside Coordinate/Analytic Geometry and Trigonometry for proper intuition.

I found the book "Practical Linear Algebra: A Geometry Toolbox" very helpful in my study.

whatshisface 7 years ago |

FWIW, I was told that matrices are linear maps pretty early on in my education. Are there any college level linear algebra / matrix calculations courses that don't tell students about that?

dhruvp 7 years ago |

Hey OP here!

When I first was introduced to matrices (high school) it was in the context of systems of equations. Matrices were a shorthand for writing out the equations and happened to have interesting rules for addition etc. It took me a while to think about them as functions on their own right and not just tables. This post is my attempt to relearn them as functions which has helped me develop a much stronger intuition for linear algebra. That’s my motivation for this post and why I decided to work on it. Feedback is more than welcome.

michelpp 7 years ago |

This is great and a nice mathematical approach to the ideas of matrices. Another great resource is 3blue1brown's essence of linear algebra:

https://www.3blue1brown.com/essence-of-linear-algebra-page

Math is Fun also has a nice writeup that explain matrix multiplication from a real world example of a bakery making pies and tracking costs:

https://www.mathsisfun.com/algebra/matrix-multiplying.html

noobermin 7 years ago |

One of the things that always irked me about the term "linear transformation" is it doesn't include affline transformations, which is funny because back in elementary school, you learn that a "linear equation" looks like Mx + b. Of course, the article states the term "linearity" when talking vector spaces (or modules) means linearity in arguments, while the term linear for a child in school means "something like a line on graph paper", and this is yet another example of terminology in the way mathematics is taught, possibly for historical reasons, that leads to even more confusion.

PS. incase you didn't know, affline transformations are not linear:

  f(x) = mx + b =>
  f(x+y) = m(x+y) + b /= mx+b + my+b = f(x) + f(y),
  f(cx) = c m x + b /= c(mx + b) = c f(x)

tptacek 7 years ago |

Their most recent post about kernels is even better than this:

https://www.dhruvonmath.com/2019/04/04/kernels/

The matrix/function stuff is elementary enough that I understand it intuitively (I suck at math), although it's neat to be reminded that given a enough independent points you can reconstruct the function (this breaks a variety of bad ciphers, sometimes including ciphers that otherwise look strong).

The kernel post actually does some neat stuff with the kernel, which I found more intuitively accessible than (say) what Strang does with nullspaces.

adenadel 7 years ago |

If you're interested in this approach to linear algebra you should read Linear Algebra Done Right by Sheldon Axler.

avip 7 years ago | |

Or pretty much any other Linear Algebra book.

adenadel 7 years ago | | |

I guess the distinction (in my mind) is the perspective that Linear Algebra Done Right takes in that they don't focus on matrix representations.

_v7gu 7 years ago | | |

Axler's book has the advantage of skipping determinants in order to provide a more intuitive approach to linear algebra.

meuk 7 years ago |

It recently occurred to me that if you use that matrices represent linear functions, you don't have to do tedious math to prove that matrix multiplication is associative (that is, (A * B) * C = A * (B * C), which allows us to write A * B * C without brackets, since it doesn't matter how we place the brackets anyway).

For a matrix M, denote f_M(x) = M * x. Then f_{A * B} = f_A(f_B(x)) so that f_{(A * B) * C} = f_{A * B}(f_C(x)) = f_A(f_B(f_C(x))) and also f_{A * (B * C)} = f_A(f_{B * C}(x)) = f_A(f_B(f_C(x))).

So f_{(A * B) * C} = f_{A * (B * C)} = f_A(f_B(f_C(x)))

adamnemecek 7 years ago |

Conjugate transpose and other adjoints are kinda nuts, they are the other part of the story

http://www.reproducibility.org/RSF/book/bei/conj/paper_html/...

Esp the ray tracing/topology relationship is nuts.

ivansavz 7 years ago |

Nice! The illustrations + color coding for the vectors are very useful.

Here is a video tutorial that goes through some of the same topics (build up matrix product from the general principle of a linear function with vector inputs): https://www.youtube.com/watch?v=WfrwVMTgrfc

Associated Jupyter notebook here: https://github.com/minireference/noBSLAnotebooks/blob/master...

Jun8 7 years ago |

Good, intuitive introduction to matrices. Next steps could be showing that there are infinitely many different matrix representations of a linear map (different from the polynomials) and they can be used for function spaces, too.

One question that usually pops up that I was confused about till recently: are rank two tensor equivalent to matrices? Answer is no, e.g. see here: https://physics.stackexchange.com/questions/20437/are-matric...

dhruvp 7 years ago | |

Hey!

Thanks for the feedback. I go into this in the next post on eigenvectors here: https://www.dhruvonmath.com/2019/02/25/eigenvectors/. I start by discussing basis vectors which I believe is what you’re looking for in your comment.

S4M 7 years ago |

I just skimmed the article quickly. Are there other ways to learn about matrices? If you don't treat them as linear applications, they are just boring grids of numbers and the matrices multiplication doesn't make any sense.

thegabriele 7 years ago | |

Which is precisely how they were presented to me at college.

sytelus 7 years ago |

The basic equivalency is fine but what about all other things you can do with matrices but can’t do with functions? For example, what is the equivalent of transpose in functions? How about Eigen values or Gaussian elimination?

zwieback 7 years ago |

Nice article. That's how I learned matrices in high school in Germany. Maybe it's different here in the US, I'll have to take a look at my daughters' textbooks.

a_t48 7 years ago | |

They were in the textbook in my high school, but we always skipped that chapter.

kregasaurusrex 7 years ago |

Having not taken a linear algebra course in college, does anyone have a recommendation for a book/course to follow?

rocqua 7 years ago | |

That would heavily depend on whether you are coming at it from a theoretical math p.o.v. or a more applied p.o.v.

Not that the applied approach should leave out the theory, because theoretical stuff like this article give a great and intuitive understanding of linear algebra. However, the more theoretical treatments should set up things like rings, modules, and even category theory that are much less useful from an applied perspective.

For the theoretical approach I've heard good things about 'linear algebra done right'. I imagine it is less appealing for the applied approach. All I can say is be wary of the 'shut up and calculate' mindset in linear algebra. Getting the ideas behind the concepts is essentially a shortcut to understanding linear algebra without any downsides.

AareyBaba 7 years ago | |

Gilbert Strang MIT 18.06 Linear Algebra https://www.youtube.com/playlist?list=PLE7DDD91010BC51F8

Essence of Linear Algebra https://www.youtube.com/playlist?list=PLZHQObOWTQDPD3MizzM2x...

diehunde 7 years ago |

Gilbert Strang uses similar approach on his Linear Algebra lectures. Much more intuitive

mikorym 7 years ago |

The next relearning step is to construct the category where arrows are matrices...

wolfgke 7 years ago | |

> The next relearning step is to construct the category where arrows are matrices...

Why not the category of vector spaces (morphisms are linear maps)?

mikorym 7 years ago | | |

So yes, this is equivalent to FinVect of the field of which entries in the matrices consist.

The difference is that here you construct the category from a simpler premise. To construct FinVect you need to include all set objects with structure satisfying some axioms.

The category of matrices is simply positive integers with as morphisms n x m matrices between the two integers. Composition is matrix multiplication.

Here [1] is a nice overview. If you can follow what is going on there, it is worth while looking at II, III and IV.

[1] https://unapologetic.wordpress.com/2008/06/02/the-category-o...

rocqua 7 years ago | | |

Isn't that the same?

I suppose that technically, the 'arrows are matrices' definition rules out infinite dimensional vector spaces, but I'd guess that OP meant to include them.

An argument against would be to keep to a small category.

enedil 7 years ago | |

Is it Vect?

je42 7 years ago |

this was an important result in the linear algebra class for first year math/cs/eng students at my university.

Grustaf 7 years ago |

What could possibly be a more basic understanding of a matrix in mathematics? There’s a reason the first teach you Linear Algebra before anything else.

j7ake 7 years ago |

Who is "we" in this context?

dhruvp 7 years ago | |

Hey! I wrote this article - “we” is referring to people who had a similar educational experience to me. I was introduced to matrices as a tool for solutions to systems of equations. I always wish I was taught the functional perspective from the beginning.

luckydata 7 years ago | |

me for example. My teachers in college did a real shit job at teaching this subject.