Math-as-Code (2015)

Math-as-Code (2015)(github.com)

161 points by yinso 5 years ago | 128 comments

Koshkin 5 years ago |

Mathematical notation is great at facilitating formal manipulations. This is its critical feature, and without it we would get stuck at the level of ancient mathematics. This is the reason it was invented a few hundred years ago in the first place. That said, I find that notation is often abused in texts as a mere substitute for the normal human language which, while allowing to compress the text, does in fact nothing to help the reader better understand what is being said but rather looks like a crazy mess of characters and other marks in a multitude of fonts, styles and sizes the only purpose of which seems to be to cause an eye strain.

LeanderK 5 years ago | |

while I understand your point, I often geht frustrated when there's text what should have been maths. It is very easy to skip details which are then hard to pin down when you try to translate it into Mathmatical notation. It can become a real problem.

Many fields have over time developed a Mathmatical notation that's most suitable to it. A notation where irrelevant details are just skipped over (e.g. with neural networks we don't carry explicitly our implicit parameters around, it would be very noisy to have thetas everywhere. Statistics/Probability has other common notation, e.g. the usual symbol for the logical and/logical or meaning min/max). One needs to learn how to read it, bu that's just practice.

Regards your comment about substitute in normal human language, often it's just easier than prose. You can of course explain something in normal language, but whats a quick equation or formal statement might become a whole paragraph in english. If you're comfortable with Mathmatical notation it's just faster to read, easier to understand and more "specific" than prose. It only looks like a crazy mess of characters if you can't connect the dots, it becomes "obvious" very quickly.

I always try to formulate Mathmatical ideas in Mathmatical notation first. It's ways easier to spot weaknesses or details you've not considered compared to prose.

mrfusion 5 years ago | |

I get most frustrated when they don’t tell you what all the variables/constants are. I feel like that happens a lot.

pfortuny 5 years ago | |

Exactly. Good maths does not use extensive “notation” per se, only when it is really necessary. Any good treatise has many more paragraphs of carefully written text than of equations: these are only used when they are truly required.

enriquto 5 years ago | | |

Moreover, mathematical notation is very universally agreed upon. You can read a math paper in a language that you do not understand and just by looking at the formulas you can get quite a good idea of its contents, methods and proofs. For code, the situation is dire: each programming language has a particular, different, and often limited way to express math.

rory_isAdonk 5 years ago | |

Anyone have a good resource on learning mathematical notation?

andrepd 5 years ago | | |

There's no such thing as "learning mathematical notation". You learn whatever field you are interested in, and the text you are reading will tell you what the notation is, as "we'll denote multiplication of `a` and `b` by `ab`", "we'll denote `A` implies `B` by `A => B`", "we'll denote derivation of `f` wrt `x` with `df/dx`", etc.

mkl 5 years ago | | |

If you want to just learn notation and you're starting from scratch and know how to code, the linked article fits that description.

However, notation for its own sake is not really something I'd suggest sitting down and learning. If you have some maths you want to learn, the notation will come along with that, so any book or article that introduces the maths you want will also introduce the notation (usually building on some foundation level of existing knowledge, e.g. high school or whatever).

ImaCake 5 years ago | | |

I find searching google/ddg to be a good place sometimes. Wikipedia has a list of mathematical symbols I sometimes consult (also includes latex code!) [0]. You could also try Khan Academy videos, which are often helpful.

0. https://en.wikipedia.org/wiki/List_of_mathematical_symbols

BiteCode_dev 5 years ago |

I hear regularly completly opposite opinions:

- the ones that wish programming would be more elegant and expressive like maths notations

- the ones that wish paper would publish a Python algo instead because maths notations are inconsistant and hard to read

I'm not a maths person, so can people with a lot of experience with it help me decide which one is the more reasonable?

augustt 5 years ago |

To be completely honest, I find it hard to believe when people claim that the reason they never 'got' mathematics was because of the notation. Actually understanding the concepts will almost always be significantly harder than understanding notation.

throwaway_pdp09 5 years ago | |

Notation can be difficult. Given my years with logic I can now read a reasonably complex piece of it and hope to interpret it quite easily if I have some feeling for where the author's going. It does take practice but you need to be exposed to it a fair bit (as when learning a natural language) and have someone to ask, to talk it through with you, until you find your feet. Without some coaching from my uni lecturers at first, and a very good book as well, I probably would have given up.

doubleunplussed 5 years ago | |

There is a natural tendency to avoid learning the notation. Even though it would be easy, it goes against our instincts to intentionally go out of our way to look up notation we don't understand. With natural language one often picks up the meaning of new words based on context, after a bit of repetition. So it does not come naturally to people to do otherwise.

throwaway_pdp09 5 years ago | | |

> to look up notation we don't understand

Not so easy. I was trying to find out which of NN and ZZ was what, try searching for those. If you don't know what the signa (summing) sign means, how are you going to find it?

UK-Al05 5 years ago | |

When I was young, I remember mathematical notation being intimidating when getting started.

_hardwaregeek 5 years ago |

I highly recommend reading The Little Typer if you want a great book that bridges math and code. It starts out describing Pi, a Lisp with some interesting restrictions (limited recursion, types can have values, etc.). They build up some cool stuff like vectors with the size encoded in the type. All of a sudden, they explain that equality is a type, and any value of said type is a proof! Turns out you can think of many proofs as manipulating data structures to get a value of a certain type.

I wonder how long until we get a somewhat mainstream language with pi types. I know Rust considered adding them. And I recently learned that Rust does allow for quantification over lifetimes^[1]. I could certainly see a language that implements dependently typed arrays. Midori for instance looked into eliding bounds checks with compiler proofs^[2].

[1]: https://doc.rust-lang.org/beta/nomicon/hrtb.html [2]: http://joeduffyblog.com/2016/02/07/the-error-model/

gergoerdi 5 years ago | |

I think at this point, Haskell is the most likely to become the first mainstream PL with Pi types: https://gitlab.haskell.org/ghc/ghc/-/wikis/dependent-haskell

somethingsome 5 years ago | |

You should take a look into Lean Theorem Prover (among others) ;)

eterps 5 years ago |

I also liked how the Fortress programming language enabled you to switch between regular syntax and math notation:

https://www.zdnet.com/article/guy-steeles-new-programming-la...

The language has faded into obscurity though.

zorked 5 years ago | |

Mathematica has that, though the math notation is more about display than input.

henrikeh 5 years ago | | |

Proper mathematical notation is absolutely for the input. Maybe not integrals and sums (which are a bit unclear), but fractions, powers and the more specialized notation is quite nice for readability.

eterps 5 years ago | |

There aren't many screenshots available, only a couple of lo-res images on Google image search: https://www.google.com/search?q="fortress" programming language mathematical&tbm=isch

thelazydogsback 5 years ago |

I think the clear answer is that you want both -- maths notation, and at least one implementation in a non-specialized programming language with clear functional/procedural semantics. If you understand one or the other, you then you can also learn the mapping between the two. This is also important to remove ambiguities in notation, makes errors in each more obvious, and aids in reproducible results. Of course, applicable input and output data also needs to be supplied for verification. If the code is too long to publish in a paper (usually there is at least some core idea that can be expressed) then it should be in GitHub or elsewhere at a stable URI -- papers often refer to academic sites that 404 soon after.

As a non-mathematician who has been recently been looking at papers in journal back-issues from about 1970 to 2010, I certainly would have benefited from this.

On a related note, another issue is that that maths must be implicitly ordered in the context of the prose of the paper, while programs have actual entry-points and (without nitpicking) explicit ordering. (It's possible that ordering can be relaxed, but correctness preferred over runtime cost.)

I think "maths-as-data" is more important here -- use a parsable common notation with enough meta-data that I can view it any way I want -- as math-with-greeks, math-with-friendly-names('en-us'), as APL, as plain Python, Python with numpy, etc.

Schiphol 5 years ago |

I thought this was going to be about a math textbook (the ideas, not the notation) that relied on previous proficiency with code. [Category Theory for Programmers](https://bartoszmilewski.com/2014/10/28/category-theory-for-p...) meets this description. Is anyone familiar with any other good examples?

sohamsankaran 5 years ago |

I would pay a fairly large chunk of my income to someone working on this kind of alternate representation of mathematics full-time. Email me at soham [at] soh.am if you're interested.

zozbot234 5 years ago | |

The closest thing to this kind of alternate representation is formal mathematics, as seen in systems like Mizar, HOL Coq, Lean, Isabelle and the like. The fact that it generally "looks like computer code" is often seen as a drawback, but it does have its advantages. In fact, some of these systems allow for constructive theories, which means that they are programming languages of a sort.

sohamsankaran 5 years ago | | |

I think there's a place for alternate representations like these outside of proofs that need to be exhaustively machine-checkable. I have, for what it's worth, had far better experiences with Coq and the like than with mathematics in general.

elbear 5 years ago | |

I'm curious, what's your motivation for wanting this? I'm asking because:

a) I'd be interested in your offer

b) It's not clear how this alternate representation would look like. I'm hoping that your possible use cases would shed some light on that

stared 5 years ago |

I've found that PyTorch is the smoothest for math <-> programming (at least when it comes to vector-like calculus).

See for example gradient descent in maths (LaTeX) and in PyTorch: https://colab.research.google.com/github/stared/thinking-in-...

For the smoothest math <-> programming

enriquto 5 years ago | |

I have the contrary impression. Fortran and matlab/octave are great for numeric math. Julia is also very good. The python math stuff (either torch or numpy) is always an unreadable mess, you are always fighting against the language. The matrix product and exponentiation operators look like a bad joke. I do not understand how people can take this stuff seriously.

stared 5 years ago | | |

To some extent, it is a matter of taste. Personally, I love functional notation (technically, in PyTorch its chaining) e.g.:

x.matmul(y).pow(2).sum()

This way we can rite a lot of things, and we don't need to make up a new combination of punctuation marks and special characters for an operation.

For example, while one can write in Python:

torch.sum((x @ y) 2)

I consider it less readable. I mean, here maybe it is fine, but once it gets longer, more complicated, or we want to add new operations, it turns into a mess.

Vide "style" section in: https://github.com/stared/thinking-in-tensors-writing-in-pyt...

dunefox 5 years ago | |

While I like PyTorch very much I have found Julia + Flux to be even better.

stared 5 years ago | | |

Could you show some examples?

I did try to use Julia quite a few times (including, I don't know 8 years ago), as I loved the philosophy (brief yet fast, types). Sadly, I never considered it readable - a mix of new concepts, legacy MATLAB syntax, and in general disregard for this part (even function names didn't have consistent naming).

If it moved a lot with that respect, I would be happy some nice examples.

analbumcover 5 years ago |

Homotopy type theory is a programming language whose original purpose was "math-as-code". Gaining widespread adoption among mathematicians, let alone programmers, seems unlikely as mathematicians dislike programming languages for their perceived lack of elegance and are firmly set in their set theoretic ways, while most programmers balk at the barest hint concision and rigor.

gergoerdi 5 years ago | |

HoTT isn't a programming language, because there are non-value normal forms. That's the whole reason behind research into various formulations of Cubical Type Theory, which is a programming language.

analbumcover 5 years ago | | |

Intensional intuitionistic type theory is a programming language. Throw in higher inductive types (still a programming language at this point) and Voevodsky's univalence and you've got HoTT. Then sure, the simplicial model is not constructive, whereas the cubical models give computational meaning to univalence, but they're still just that, models of HoTT. So would you prefer I had said HoTT has models that are programming languages?

rantwasp 5 years ago | |

programming is a special applied type of math.

ilaksh 5 years ago |

This is really cool, but where I get really lost in not on the more basic stuff covered here, but the more advanced math that shows up frequently in ML papers for example.

It seems like there is a lot of stuff in math that is kind of like code libraries or functions in programming, except you are supposed to just remember exactly how it works rather than having the source code.

winrid 5 years ago |

I love this kind of stuff. Following this thread for books.

Currently I'm doing Data Science From Scratch except in C++ instead of Python.

https://github.com/winrid/data-science-from-scratch-cpp

vthommeret 5 years ago |

I plan to create a Show HN shortly, but I created an interactive Python tutorial to teach engineers how to read and implement math using the NumPy library, called Math to Code:

https://mathtocode.com

It takes you from basic functions like square roots and absolute values, to summations, matrix multiplication, to measures like standard deviation and the Frobenius norm.

I was inspired to create it while taking the Fast.ai course and Jeremy Howard showing what looked like a "complicated" Frobenius norm equation that could be implemented in a single line of Python.

It's open source and should also work on mobile.

gnramires 5 years ago |

If anyone is seriously interested in doing mathematics on the computer, give Sage a try. It gets rid of many machine-particularities, like floats not being real numbers (due to limited precision). You can easily represent and solve equations, do calculus, integration, etc. And it supports a python-like programming too.

So you can write a for loop which solves an equation with different parameters at each iteration, for example. You can write logical proofs too, although that's not the main purpose. (I think doing calculations on your computer and generally being a superb Math assistant is where it's at).

dependenttypes 5 years ago | |

> It gets rid of many machine-particularities, like floats not being real numbers (due to limited precision).

I will have to doubt that for the simple fact that it is impossible to have a real number type on a computer.

gnramires 5 years ago | | |

Real numbers are represented as abstract data structures, not as infinite series of digits (although that indeed fits in a Turing machine? :p)

So for example, '2' is recognized as an integer and is represented using arbitrary size integers. You can also however write 'x = sqrt(2)' (or just 'sqrt(2)'), which has no finite digital (irrational), 'x' is a real number. You can then ask for finitely many digits of x, with 'x.n(5)' (gives 5 digits), or write something like 'y=x^3+3x-4', which gives another real number, represented as this polynomial data structure itself.

The only problem with this is irreducibility. You can compose arbitrarily many operations on floats and still get a float of the same size. With this approach, it may not be possible to simplify a series of operations so the representation can grow unbounded.

edit: Fun fact, there are (real) numbers that indeed cannot be represented in a finite computer no matter what -- but they cannot be represented in paper or uniquely represented in any finite abstract form either! This follows from the pigeonhole principle: finite expressions may represent numbers, but there are uncountably infinite (2^(N0)) real numbers, and only countably infinitely many expressions. So indeed almost every real cannot be represented. You can think of those as not being identifiable with any property, so there's no finite expression to describe them. They're more or less random.

dang 5 years ago |