Math Basics for Computer Science and Machine Learning [pdf]

Math Basics for Computer Science and Machine Learning [pdf](cis.upenn.edu)

883 points by oldgun 6 years ago | 118 comments

The professor who wrote this is Jean Gallier, and I had him for advanced linear algebra at Penn. I am also pretty close to him in so far as a student can be close to a professor. On a personal note, he is one of the funniest professor I've had, and all math professors are characters.

For the people who are interested in ML, the thing to remember here is that he is a Serious mathematician, and he values rigor and in-depth understanding above all. A lot of his three star homework problems were basically impossible. He writes books first and foremost so he can understand things better. In math books, there's the book you first read when you don't understand something, then the book you read when you understand everything. This is book in the link.

for linear algebra, this:https://www.amazon.com/Introduction-Linear-Algebra-Gilbert-S...)

jimbokun 6 years ago | |

As someone who never took an under graduate linear graduate course, videos of Strang's lectures got me through a couple graduate machine learning courses.

https://ocw.mit.edu/courses/mathematics/18-06-linear-algebra...

What he does with chalk and a blackboard is far more effective than anything done today with Powerpoint, fancy computer animations, or what have you.

xiaolingxiao 6 years ago | |

Apparently, you guys are overloading the university's server! Nice job guys!!

Gene_Parmesan 6 years ago |

From the start of Chapter 2:

"In the following four chapters, the basic algebraic structures (groups, rings, fields, vectorspaces) are reviewed, with a major emphasis on vector spaces. Basic notions of linear algebra such as vector spaces, subspaces, linear combinations, linear independence, [...], dual spaces,hyperplanes, transpose of a linear maps, are reviewed."

If anyone needs to start even earlier than this, I've actually found "3D Math Basics for Graphics and Game Development" to be a good true intro for linear algebra-related stuff. I think this would probably hold even if your primary interest is something other than graphics/game dev. Some of the text in that book's intro is a little cringey with its reliance on kind of juvenile game references, but I didn't find that sort of writing continuing during the actual text. So just push past that stuff.

I got a copy of it to act as a refresher before diving into Real-Time Collision Detection since it's been quite a long time since formal math for me (as in, high school, because I'm self-taught in CS). I've managed to make up a lot of ground by working hard and finding classes to audit online (Strang's linear alg course on OCW is a good one), but I have found that depressingly few math texts which claim to be "introductory" are actually truly introductory.

This isn't a slight against the linked work, I absolutely love when profs make resources such as this freely available.

"How to Prove It" and "Book of Proof" are also great intros to formal math, if less immediately practical.

hx2a 6 years ago | |

> If anyone needs to start even earlier than this, I've actually found "3D Math Basics for Graphics and Game Development" to be a good true intro for linear algebra-related stuff.

Did you mean to write "3D Math Primer for Graphics and Game Development" [1]? If you did, I agree 100%. I got a lot out of this book and was able to put it to good use for several projects.

[1] https://www.amazon.com/Math-Primer-Graphics-Game-Development...

codesushi42 6 years ago | |

I would disagree about the gamedev book reference, unless you are referring to the real basics of linear algebra.

The really important concepts for ML are least squares, eigenvalues and vectors, and SVD. Those concepts are not very relevant to game programming.

Well, least squares can be solved with projection, which is relevant for converting between coordinate spaces. But game dev isn't going to give you that intuition.

Wurdan 6 years ago | | |

I believe the person you're replying to was attempting to help people like me, who haven't had a math lesson since leaving high school (in my case over 16 years ago) and whose level of math is roughly "You want me to multiply something? Let me get my phone."

So while the book in question might not be the best resource, it probably is a better starting point than the linked doc.

commandlinefan 6 years ago | |

Not to look a gift horse in the mouth, but I'm always irritated by math books that include practice exercises but no answers in the back of the book to check your work against.

bobby358 6 years ago | | |

I believe they do this because the real target audience is other professors. The author wants those professors to use their book for their own courses, and a bunch of problems with no answeres saves a lot of time. There seem to be very few books written with the autodidact in mind. Sometimes you can find the solutions manual via eBay or torrent though.

Liquix 6 years ago | |

Do you have a link to that book perchance?

jointpdf 6 years ago |

“Math Basics” is quite the misnomer—it gives the impression that one would need to study all of the contents of this book to be an effective practitioner in CS or ML. Memorizing every definition and theorem in this book would be neither necessary nor sufficient for that purpose.

Keep in mind it can take an hour, and sometimes way more, to really absorb a single page of a math book like this (do the math). This is more of a reference text.

melodrama 6 years ago |

Great book.

I think it's a good time to mention a couple of nice books (related)

1. Elementary intro to math of machine learning [0]. Its style is a bit less austere than that of OP's. It also has a chapter on probability. It could possible serve as a great prequel to the book linked in the OP.

2. The book on probability related topics of general data science: high-dimensional geometry, random walks, Markov chains, random graphs, various related algorithms etc [1]

3. Support for people who'd like to read books like the one linked in the OP, but never seen any kind of higher math before [2]. This book has a cover that screams trashy book extremely skimpy on actual info (anyone who reads a lot of tech books knows what I am talking about), but surprisingly,it contains everything it says it does and in great detail. Not even actual math textbooks (say, Springer) are usually written with this much detail. Author likes to add bullet point style elaboration to almost every definition and theorem which is (almost) never the case with gazillions of books usually titled "Abstract Algebra", "Real Analysis", "Complex Analysis" etc. Some such books sometimes attach words like "friendly" to their title (say, "Friendly Measure Theory For Idiots") and still do not rise to the occasion. Worse yet, a ton (if not most) of these books are exact clones of each other with different author names attached. The linked book doesn't suffer from any of these problems.

[0] Mathematics For Machine Learning by Deisentoth, Faisal, Ong

https://mml-book.github.io/book/mml-book.pdf

[1] Foundations Of Data Science By Blum, Hopcroft, Kannan

http://www.cs.cornell.edu/jeh/book%20no%20so;utions%20March%...

2] Pure Mathematics for Beginners: A Rigorous Introduction to Logic, Set Theory, Abstract Algebra, Number Theory, Real Analysis, Topology, Complex Analysis, and Linear Algebra by Steve Warner

https://www.amazon.com/Pure-Mathematics-Beginners-Rigorous-I...

iamcreasy 6 years ago | |

I'll check out the last one. Currently I am teaching myself real analysis.

SKILNER 6 years ago |

Very first sentence of 2.1 is full of notation, symbols and terms that I, as a prospective student, might not understand.

So many teachers seem incapable of stepping outside their sphere of knowledge and seeing what they know and others do not. And so much work went into this.

merlinsbrain 6 years ago |

I love that they have problems you can solve as well at the end of (almost) every chapter.

This IS a lot of math (1,962 pages) and it’s missing a preface/introduction which would have been helpful to understand if I need to go linear or if a la carte is okay. At the moment I’d assume each major section is independent.

Awesome find! Wonder how It’s used. (One of) the author(s) seems pretty prolific too - http://www.cis.upenn.edu/~jean/

piggybox 6 years ago | |

Every time I came across a new book on HN, I feel more needs to be done in my rest of life :)

Koshkin 6 years ago | |

> a la carte

Yeah, I wish we had an online resource (other than Wikipedia) anyone could learn any sort of math from in a systematic way... Oh well.

MikeTheGreat 6 years ago | | |

I'm wondering - is this a genuine request, or a snarky, implicit reference to an online resource for learning math somewhere?

I'd love to know about the existing resource, if it exists. (The only thing that comes to mind is Wolfram Alpha, which didn't seem 'systematic' the last time I skimmed the main page)

gonza 6 years ago | | |

The way i'm trying to learn math right now is with https://ocw.mit.edu, then you can look up into CS or Math programs to progress with other courses

bigred100 6 years ago | | |

https://courses.maths.ox.ac.uk/overview/undergraduate#37105

Oxfords course stuff provides some structure to the interested user

graycat 6 years ago | | |

What would you like to know? Ask and here you might receive ....

meuk 6 years ago |

Whoah, this covers a lot. I was expecting some linear algebra, calculus, and discrete math, but there's actually some stuff in there I don't know after doing a masters in math.

djaychela 6 years ago | |

That makes me feel somewhat better - I saw the title of 'Maths Basics' and thought 'Great!'... then I saw it's 1,962 pages - if that's the basics, how much is the intermediate and advanced bit?

raxxorrax 6 years ago | |

Thought the same. 2000 pages - Basics was probably an understatement...

Just looked at a few pages and it seems really illustrative. I am just a light-weight mathematician as a computer scientist, but I really would have liked such a comprehensive script for studying. I hate it when profs reduce everything to minimal definitions and expect studends to make sense of it. There are countless books but it is always a gamble that they focus on the topic at hand and don't suffer from the same problems.

This even gives you "motivational examples" which are extremely helpful for comprehension in my opinion.

abhisuri97 6 years ago |

I love professor gallier! He's an incredible person.

That being said, this is faaaaaar beyond basics. It'd be more appropriate to call this an incomplete (aiming to be comprehensive) guide to almost everything you need to know in computer science (related to math).

graycat 6 years ago | |

My look at the table of contents looked like the book is short on both probability and statistics.

markus_zhang 6 years ago |

That's almost 2,000 pages of math...I don't know why and how, but somehow I forgot most of the Statistics knowledge I obtained as a graduate student (in Stat) 10 years ago.

I remembered that I took an advanced course about Bayesian Inference, and one course about Multivariate Statistics (PCA, Factor analysis, these kind of things), and my project is about Bernstein Polynomial. That's it...

xenihn 6 years ago | |

You forget complex things that you don't use regularly. Math, spoken languages, written languages, coding...of course you can re-learn it, and re-learning is faster than learning it for the first time.

Based on speaking to my managers in the past, it seems like a year-long lapse is enough for you to lose an incredible amount of retained knowledge/skill. But it's not a permanent loss.

markus_zhang 6 years ago | | |

Yeah agreed, sometimes reading a research paper from the DS team would actually ring a bell somewhere and I know where to look at. I'm re-learning Statistics from bottom up at the moment lol but this 2,000-page book really looks daunting. I'm pretty sure I didn't take any advanced optimization course back in university.

emmanueloga_ 6 years ago |

In basic calculus one can burn countless hours memorizing mechanical rules to derive and integrate different function forms, or one can just plug the function into something like wolfram-alpha and get, for a lot of useful cases, a symbolic answer, or at least some approximate answer for a point or interval.

The point is, understanding integrals and derivatives doesn't require one to memorize all the mechanical rules. Using software to compute those functions can be a huge time saver. No one should go with pen an paper double checking if that polynomial integral is correct or not!

With a book almost 2000 pages long, I wonder if this books leans more heavily on the mechanical-rules side of math. In my mind, is the difference between writing a book such that you can write your own wolfram alpha, or writing a book so you can just use it.

krosaen 6 years ago |

Haha 1900 pages on "basics"?

I suspect there are better resources for each topic covered (e.g Gilbert Strang books and OCW lectures for Linear Algebra), but it is definitely interesting to peruse and get a sense of relevant topics.

jvehent 6 years ago |

"Math Basics"

It's 2000 pages long....

kouh 6 years ago | |

Chapter 1 can't be more basic, a great summary of what will remain intuitively after reading the whole opus

euph0ria 6 years ago | | |

lol

j7ake 6 years ago |

Even if you were ambitious and manage to read 5 pages per day everyday this would take you more than one year to read this from start to finish.

TimMurnaghan 6 years ago |

Nice to see wavelets in here - but it's a shame that he seems to be encouraging people to actually use Haar wavelets. They're fine for teaching - but there are usually better choices in real life. Daubechies are a good default

floki999 6 years ago |

The writing style of this book i.e. rigorous math notation and proposition/proof presentation is going to put off the great majority of potential CS and ML readers. At almost 2000 pages it sure makes a great door-stop though.

impaktdevices 6 years ago |

Me: Oh, good! I've always been pretty good at math but I want to learn how to make sense of the math I encounter in CS and ML.

[Reads the first paragraph of the 2nd chapter]

Me: I don't know anything about math. At all.

amthewiz 6 years ago |

Basics should be concepts that get you to 80% and tell you where to look for the rest 20%. This book tries to get you directly to 95% and is best treated as a reference book.

laichzeit0 6 years ago |

Strangely, probability theory is completely omitted.

kebman 6 years ago | |

Is there a probable cause?

ps101 6 years ago |

I really can't figure out who the target audience for this book is, if it has a target audience at all.

decotz 6 years ago |

403 forbidden. Can someone host this?

ccffph 6 years ago | |

https://web.archive.org/web/20190730230113/https://www.cis.u...

jaimex2 6 years ago |

I look at this and profoundly thank the people who make ml libraries for us the rest of us.

vecter 6 years ago | |

The vast vast vast vast vast majority of this book (which is more of a reference and encyclopedia than an actual book for learning) is not required for implementing most ML libraries.

strikelaserclaw 6 years ago |

This is more like courses a talented undergrad math major would take through 4 years.

estomagordo 6 years ago |

Okay, so this looks potentially awesome. But given that it is a reference work rather than some introductory "basic" little quick read-through, I'd prefer to have it in paper form.

Any hope of that happening?

currymj 6 years ago |

this is an incredible reference for a machine learning researcher who wants to fill in some gaps in their existing mathematical knowledge.

But I would be shocked if this would be of any use for someone trying to learn a little linear algebra in order to play with neural networks. For that I think you still want Strang.

I think "foundations" might have been a better word than "basics" here. "Basics" in any case is not in the printed title, only in the filename.

parasdahal 6 years ago |

403 forbidden, can someone help us out?

itchyjunk 6 years ago | |

It now redirects to google drive. Simply refresh. It redirects to [0].

[0] https://drive.google.com/file/d/1sJvLQwxMyu89t2z4Zf9tD7O7efn...

iserlohnmage 6 years ago |

Can someone recommend me a book on Linear Algebra, Statistics and Probability?

neighbour 6 years ago | |

Second this. If it has exercises, even better. About to undertake my MSc Comp Sci and would like to learn Discrete Math as I'm coming from a non-math background (did B.IT at school and only took two math courses).

bobby358 6 years ago | |

I really like Linear Algebra Done Right by Axler

bigred100 6 years ago |

That’s... quite a lot of math

tempodox 6 years ago |

This book looks great, I've been looking for a resource like that.

mjortberg521 6 years ago |

Great to see Prof. Gallier featured on here!

sgt101 6 years ago |

Math "Basics" in nearly 2k pages!

ForFreedom 6 years ago |

Q: Is this all necessary for ML?

ovi256 6 years ago | |

No, you can be an ML practitioner with just an intuitive understanding of, say, gradient descent works and you would do fine. You can even pick up that intuitive understanding on a strictly need-to-know basis, when it's needed for learning an ML technique. That's what fast.ai teaches.

For being more than a practitioner, like an implementer of new ML libraries or a researcher, of course you'd need to know more.

TrackerFF 6 years ago | |

No, but there are some good fundamentals. I.e, The optimization bit for when dealing with SVM, Kernel methods, etc. All parts that college ML courses cover in depth

gantkimthis 6 years ago |

is Linear Algebra something you need to work with machine learning?

planetabhi 6 years ago |

This is a good book

ppcdeveloper 6 years ago |

This is nice.

manca 6 years ago |

Wow!