Pedagogical Downsides of Haskell

Pedagogical Downsides of Haskell(ciobaca.substack.com)

74 points by scscsc 3 years ago | 110 comments

yamtaddle 3 years ago |

I find its syntax & idiomatic style incredibly difficult to follow, in a way nearly no other languages have been for me, including some functional languages (OCaml doesn't seem nearly as bad to me, for instance).

It's sometimes implied that those who trip over Haskell just aren't big-brained enough to understand various important concepts related to it, but I've found they're usually very easy to grasp, provided the explanation's not using Haskell examples. If all programming were Haskell, I probably never would have become a programmer in the first place. Would have taken me too long to figure any of it out, probably would have concluded I wasn't smart enough to be a programmer.

I do wonder if there are some shared experiences or common patterns to who tends to love Haskell, and those who don't. I also feel nigh-dyslexic trying to read math formulas. Human language and broadly C-family programming languages, on the other hand, seemed easy and natural to me, almost effortless to pick up. Wonder if there's a "mathy"-person versus "languagey"-person divide on finding Haskell legible.

I'm not sure it's the whole thing, but I think I've also figured out that I find algorithm-type reasoning far easier to follow and work with than equations or proofs. Like, the only way I can begin to get traction with an unfamiliar equation is to break down what each term and operation "does" to something "moving through" it—it's tedious as hell. Might be something there.

Hirrolot 3 years ago | |

I find the terminology that Haskell uses quite misleading for software engineering. It borrows concepts from category theory with quaint names such as a "monad", "endofunctor", "catamorphism", etc. The problem is that, instead of a "monad", we can say "brrrdogcogfog" and nothing will change -- the name is absolutely irrelevant to the problem being solved. Given that a monad is an interface for sequential computation, a much better name would be something like "Seq", "SeqComp", or something like that.

consilient 3 years ago | | |

> Given that a monad is an interface for sequential computation, a much better name would be something like "Seq", "SeqComp", or something like that.

Just because you can look at something as describing a computation doesn't mean you always should. For example:

    data BinaryTree x = 
        | Leaf x
        | Node (BinaryTree x) (BinaryTree x)

    instance Monad BinaryTree where

        return :: a -> BinaryTree a
        return x = Leaf x

        bind :: (a -> BinaryTree b) -> BinaryTree a -> BinaryTree b
        bind f (Leaf x) = f x // replace a leaf with the result of calling f on its label
        bind f (Node l r) = Node (bind f l) (bind f r) // traverse down the tree, ultimately replacing all the leaf nodes with a new subtree

You can choose to interpret a binary tree as describing nondeterministic computation where you have two choices at every step, but I rarely do. Most of the time trees are just trees.

truculent 3 years ago | | |

As I understand it, monads help solve the problem of sequential computation in Haskell but the concept is not limited to that. For example, how would you consider the monadic properties of data type like Maybe or Either to be (exclusively?) interpreted through the lens of sequential computation? What about commutative monads where the order doesn’t matter?

pharmakom 3 years ago | | |

F# calls them Computation Expressions which is far more approachable imo

matheusmoreira 3 years ago | | |

> Given that a monad is an interface for sequential computation

And what the hell is an interface for sequential computation? I think I understand what these Maybe types are and what they accomplish but "interface for sequential computation" sounds a lot like those buzzwords people mix together that could mean anything.

keithalewis 3 years ago | | |

Composition/SeqComp comes for free. A monad is how to wrap something up,apply a function to the wrapped up thing, and how to unwrap it.

viscanti 3 years ago | | |

A monad is just a monoid in the category of endofunctors. Would most people know what "brrrdogcogfog" means? Couldn't we make that argument about literally any word? I don't see why it applies more here than elsewhere. For people who have experienced it before, it's straightforward and easy to work with, for those who haven't, then there's a learning curve. No one would likely have encountered "brrrdogcogfog" before and everyone would have to go through the learning curve.

medstrom 3 years ago | | |

Like a Javascript ArrayBuffer? No, a Stream?

throwaway17_17 3 years ago | | |

tl;dr -> I agree that the terminology is probably not something that enhances cohesion amongst devs using Haskell, and certainly can be distracting in a pedagogical setting.

I think the fact your experience leads you to believe that a monad is an interface for sequential computation. A monad is often used for ordering computations, but Haskell’s monads can also be commutative (like the Reader instance of monad) which do not order anything.

The real issue is that the naming convention where some typeclass is named after a concept in category theory means wildly different things to different developers. For instance, I would expect a type/typeclass named for some categorical construct to behave in the way the categorical construct behaves and that would be the extent of what I use it for. However, some developer may see a particular usage the same construct and extrapolate that said construct is intrinsically tied to that algorithmic pattern of usage.

So the problem is controlling expectation and managing consistency throughout the dev community. I doubt Haskell will ever get away from the category theory inspired libraries and the subsequent naming conventions. See the relatively lively development of the profunctor optics based work. But, I can certainly see how it may distract or confuse newcomers.

yoyohello13 3 years ago | |

My original degree was in Math and I can definitely 'feel' a difference when reading Haskell vs other languages.

Writing/Reading Haskell gives me a similar feeling to doing proofs than programming.

Even other functional programming languages don't give me that 'in the math class' feeling that Haskell does.

jokoon 3 years ago | |

I use generators, list comprehension and a lot of lambda stuff with python.

It's a bit fun because it's very short to write, it's concise and it helps a lot working only with dict and tuples etc.

Not sure if it's faster, but it's always a bit longer to write and think about, and I'm not sure it's easier to read and understand.

Sometimes it feels a bit like code golfing, because you can do a lot of things with very few lines.

It's immensely better to remove 99% of side effects, the code is shorter and more compartmentalized, so it's just easier to deal with.

Although I'm doing this alone, and I'm not confident that I could enforce this sort of software design in a team.

alpaca128 3 years ago | |

I'm in the exact same boat. Haskell code feels more like abstract maths and I feel more at home when I can just easily track the data flow. The language and community uses relatively abstract terminology due to its roots and it's just a bit too cryptic to me.

Though I'm glad newer languages are starting to adopt more features from the functional territory for the situations where it just makes more sense.

mrkeen 3 years ago | |

> I'm not sure it's the whole thing, but I think I've also figured out that I find algorithm-type reasoning far easier to follow and work with than equations or proofs.

For me it's the opposite. Once i figure out what an expression is, I do not want it to change on the next clock cycle.

paddw 3 years ago | |

I suspect you are right that there's a type of person Haskell feels very intuitive to. I think if your mind works that way you might have a hard time appreciating the degree of confusion "regular" programmers face when trying to decipher the mess of symbolic soup.

yakshaving_jgt 3 years ago | | |

There is no reason for Haskell to be a "mess of symbolic soup".

You can make Haskell about as human-readable as Ruby if you choose to.

hgsgm 3 years ago | | |

Regular programmers were pretty happy with Perl soup.

javajosh 3 years ago |

The pedagogical downside of Haskell is that it ignores the physical reality of the machine. Physically, a computer is imperative, has mutating state, and is filled with all kinds of possible race conditions. Even after you apply the operating system, allowing processes to live together (and giving you space to define new ones), very few constraints are placed on your program and process space.

Instead of building on this reality, Haskell asserts that the starting point is not physical reality, but rather a mathematical formalism called "The Lambda Calculus", the physical machine is looked at with disdain and pity, its limitations to be worked around to provide the one true abstraction. This is the original sin of Haskell, because it is an attitude that isn't driven by a need to make a thing, but aesthetics and a peculiar intellectual dogma around building that ultimately becomes a stumbling block.

In my view, you have to respect the machine. Abstractions can be beautiful, but they are ephemeral, changeable, unreal. The danger is that these illusions become a siren song to makers who are always looking for better tools, and to these makers the abstractions become realer than the machine. Haskell's power users famously don't actually make anything with it (modulo pandoc and jekyll), and my guess is because either they find that 90% of real-world things you want to do are "ugly" from Haskell's point of view, and so are left as distasteful "exercises for the reader", or they get so distracted by the beauty of their tools they never finish.

In any event, Haskell is a road less traveled for good reason.

mrkeen 3 years ago |

Brilliant write up.

> There is also a school of thought that you should start Haskell by teaching the IO monad first, but I am not convinced: in my experience, if someone gets exposed to IO early on, they will contaminate all their functions with IO. They will essentially end up writing Java in Haskell.

I don't think this is such a bad starting place. Crawling before walking. Purifying an (unnecessarily-) IO function into an ordinary function is a good exercise.

Trying to enforce non-IO from the start would be like enforcing 'no new keyword & factories only' in another language.

mprovost 3 years ago |

This is great and a lot of it rings true to my experience writing a book to teach Rust. It's basically a giant topological sorting exercise to find the optimal order to introduce syntax so that you steer clear of rabbit holes. Or you just end up drawing the owl.

For example, to implement a simple "hello world" program in Rust you have to use a macro (println!), so you can't even look for a function signature in the standard library docs to help. So you can either just say "don't worry about this for now, just trust me" or spend a whole chapter diving into macro syntax. The number of concepts you need to implement a basic program is pretty large and you could easily spend a chapter going into any of them.

Personally I'm not a fan of the approach in this post to just "lie" to people but I do find myself showing a non-optimal implementation because that's all the syntax I've introduced up to that point. Then later I show how to do it better. I know some readers just want the final answer up front though.

glynnormington 3 years ago | |

I provide a dependency diagram so students can work out where to apply most effort and how to catch up if they miss something.

I also show likely dependencies from the course assessment to the various topics. For instance, there is a strong dependency on the IO monad, but a weaker/optional dependency on (general) monads.

In terms of presentation order, I tend to over-simplify early in the course and circle back and make things more precise later.

(I'm teaching a 2nd year university course on Functional Programming with Haskell for the first time, so I found the OP fascinating. Thanks!)

agentultra 3 years ago |

I wonder if there could be a (or already is) a "teaching" Prelude designed for this purpose.

One of the reasons the standard Prelude includes partial functions and specialize versions of `map` and `filter` is to support the pedagogical use-case (as far as I understand the situation). Most production applications will use a custom Prelude of some kind in order to prevent programmers from using foot-guns like `head` or make things more general in the case of `map` and `filter`.

Turns out using linked-lists for everything isn't the best idea but a lot of Haskell applications will use them because it's in Prelude.

Bit of a balancing act supporting both use cases.

jerf 3 years ago | |

I don't know if there is one already, because the Haskell community generally heads in the other direction with its alternate Preludes.

But the effort to fix up the fixable issues mentioned in the post is about the same as writing the post was. Getting it distributed to the students may be a bit harder, depending on the local setup.

But it's definitely fixable with Haskell as it is today.

Linked lists are particularly tricky in Haskell, because as a data structure manifested in memory, they really stink. But as a lazy data structure traversed exactly once and thus just serving as a mechanism for providing "the next thunk", they're fine. Haskell and its laziness completely conflates the two of these, so it ends up being easy to think you have one and end up with the other.

agentultra 3 years ago | | |

Definitely. Linked lists are great for pedagogy and useful in many applications. I think it’s a bit of a sign that the struggle between pedagogy and practice can lead to suboptimal outcomes for both parties.

yakshaving_jgt 3 years ago |

I think Elm is second to none as a tool for learning FP.

It compiles quickly, the guidance offered in error messages are best in class, it's small, and the mental model is consistent.

In fact I think it's far easier to learn Elm (and also perhaps web UI development wouldn't be such a shitshow if programmers earlier in their career used Elm to build their mental model) than it is to learn:

- React

- Redux

- Immutable.js

- Lodash/Ramda

- ES${CURRENT_YEAR}

- Webpack/Parcel/Grunt/Groan/Whatever

- etc…

I've seen so many early programmers go through some React course thinking they've learned FP, and yet struggle to solve basic problems by applying functions to values.

tikhonj 3 years ago |

Code World[1] is a great project that addresses a number of the problems from the article, with an eye towards using Haskell to teach children basic math and programming simultaneously. Code World directly addresses a number of the obstacles outlined in this article:

1. Using an online editor with a rich built-in library removes any toolchain problems.

2. A custom standard library simplifies pedagogically unnecessary details like Foldable

3. The custom standard library also avoids currying (f(a, b) for functions rather than f a b)

4. Custom error messages improve the feedback students get from the compiler

I would highly recommend Code World to anybody looking to teach programming with Haskell. If you want to teach Haskell in a way that fits the existing ecosystem, it's also possible to run Code World without the custom standard library[2].

[1]: https://code.world/#

[2]: https://code.world/haskell#

tome 3 years ago |

I find the go pattern absurd. Which of these is easier to read:

    foldr k z = go
      where
        go [] = z
        go (y:ys) = y `k` go ys

    foldr k z = foldr_k_z
      where
        foldr_k_z [] = z
        foldr_k_z (y:ys) = y `k` foldr_k_z ys

gpderetta 3 years ago | |

The first.

tome 3 years ago | | |

Do you have insight you can share into why you find it that way?

abecedarius 3 years ago | |

How about 'folding'? I've settled on that kind of name for looping/recursing helper functions.

Scheme has a bit of syntactic sugar called "named let" which makes this internal-helper pattern more concise/direct.

ghusbands 3 years ago | |

I think I prefer this:

    foldr _ z [] = z
    foldr k z (x:xs) = k x $ foldr k z xs

tome 3 years ago | | |

I suspect we all prefer that, but the point of abstracting out a closure that captures k and z is for performance.

asplake 3 years ago |

Point 11 surprised me. Not the “go” thing but the “where” syntax – I wish more languages had it!

wnoise 3 years ago | |

Yes. Where is often lovely -- I want to delegate details, and not think about them yet, but keep that delegation scoped to the function that needs the relevant details.

But calling auxiliary functions "go" is almost always bad naming.

chowells 3 years ago | | |

"go" is a fantastic name for communicating that all you're doing is exactly what the containing named definition promises. It's a lot better than adding "Worker" or "Impl" as a suffix of the same name as the parent. It contains no additional information because there's no additional information to contain - the parent name already says it all. So you might as well make it short and a standard idiom.

mrkeen 3 years ago | | |

> But calling auxiliary functions "go" is almost always bad naming.

Calling auxiliary functions "go" is like calling loop variables "i".

jy14898 3 years ago |

PureScript might be worth considering, a few of the downsides listed here aren't in PS, for example: Int/Number primitives aren't overloaded, strict evaluation, the various tools like package management are easy, explicit Prelude means you are free to import foldl from Array for example.

Of course PureScript has it's own downsides not apparent in GHC

Laaas 3 years ago | |

PureScript also has the huge advantage that it's trivial to build "something". When teaching Haskell, I'm never sure what to build as an example. CLI tools aren't attractive, making a webserver is complex, and so is making a native UI. Of course you can use GHCJS, but at that point, why not just teach PureScript in the first place?

tome 3 years ago | | |

Or use gloss and make "something" arguably more easily than in PureScript?

https://hackage.haskell.org/package/gloss

iamnotsure 3 years ago |

Had good experience at https://exercism.org/tracks/haskell

I don't think this article is helpful for beginners.

burnished 3 years ago | |

I think this article's audience is teachers of beginners, not beginners themselves. At least the author is writing about their experience as a teacher.

Don't know why you thought it would be an article for beginners, but good on you for linking a resource regardless.

iamnotsure 3 years ago | | |

The article is an introduction to the basic concepts of Haskell, thus beginners may be considered a target audience. However, the style and the content brings to my mind the dreaded monad tutorials. I'm not convinced the article is about pedagogical downsides of specifically Haskell. It mostly reads like a collection of random purported gotchas/differences from someone with experience with other languages.

deafpolygon 3 years ago |

For someone not familiar with functional programming (but familiar with OOP/procedural), this was not easy or intuitive for me to follow.

FpUser 3 years ago |

I generally trying to avoid single paradigm languages that are trying to show me the one and only "true" way. I see no business benefits coming of of their use.