Idris, a language that will change the way you think about programming (2015)

Idris, a language that will change the way you think about programming (2015)(crufter.com)

118 points by kenshiro_o 10 years ago | 100 comments

bojo 10 years ago |

    app : Vect n a -> Vect m a -> Vect (n + m) a

That is pretty amazing if you ask me. I look forward to the day when we all use languages which save programmers from themselves.

vosper 10 years ago | |

As a Python programmer who doesn't understand this notation - what am I looking at, and what's amazing about it?

badsock 10 years ago | | |

The "Vector m a" means that it's a vector (like a Python list) that can only be m elements long, and those elements can only be of type a. E.g. "Vector 3 Int" will always be a Vector with three integers in it. If you try to treat it like a vector of any other length it will refuse to compile (e.g. pass it to a function that requires that it have 4 or more elements).

The whole line is the type declaration for a function (app) that takes a Vector of length m, and a Vector of length n, and produces a Vector of length (m+n). Which is to say, the compiler knows at compile time what length the resulting Vector will be, and will fail to compile if the function you've written doesn't - provably - always meet that requirement.

From a Haskeller's perspective it's amazing because container types are famous loopholes for code that produces runtime errors. For instance, if you call the "head" function (returns the first element) on an empty list it will halt the program at run time with an error, despite the fact that it passed the type check (because an empty list and a full one have the same type).

As someone who learned Haskell and subsequently have been writing a lot of Python, I keep a mental tally of how many of my bugs (some of which took ages to track down) would have been caught immediately by a type system like Haskell or Idris'. I'd say it's well over half.

In Haskell those kind of errors are drastically reduced, but a few still slip through. Languages like Idris can catch even more of them, in a clever way, which is awesome.

tunesmith 10 years ago | | |

You have two parameters - a vector of size "m", and a vector of size "n". The function signature then expects to get back a vector of size "m + n".

Contrast that with a language with method signatures that can only return types like "Vector" or "List", without any information that is dependent on the incoming parameters.

So then, the logic is checked - if the method returns a vector that is anything other than the size of the two incoming parameter vector sizes added together, it won't compile.

In other words, even more behavior that would normally be a runtime bug is checked at compile time, which is a good thing if you are working in a domain where correctness is important and want to avoid runtime bugs.

dllthomas 10 years ago | | |

One thing that no one touched on:

In reading Haskell (and apparently Idris) types, a name that starts with a lower case letter is an unbound type variable - sort of like a template parameter in C++. The `a` in each of the Vect's must be the same type but it can be any type (picked at the call site, for any given call).

eximius 10 years ago | | |

Basically the length of the vector is encoded at the type level. Lets you do all sorts of cool stuff.

You can have number systems modulo n, which are only compatible with other numbers modulo the same n because they are distinct types.

It's just another level of abstraction that pretty much nothing else has.

andrus 10 years ago | | |

    app : Vect n a -> Vect m a -> Vect (n + m) a

is a type signature for a function, `app`, which should append a vector (`Vect`) of length `n` to a vector of length `m`.

What's cool about Idris and other dependently-typed languages is that the length of the resulting vector, `n + m`, can be tracked in the type. This can prevent a whole slew of errors.

frankpf 10 years ago | | |

I'm not an expert in Idris, but I will try to answer this as best as I can.

That's the Haskell type notation (Idris and Haskell are similar). Here's an example:

  plus5 :: Int -> Int
  plus5 n = n + 5

The first line is the type declaration of the `plus5` function. You can read it as "plus5 takes an Int as argument and returns another Int". The second line is the function declaration. The equivalent in Python would be:

  def plus5(n):
      return n + 5

The syntax is the same when functions have more arguments:

  add :: Int -> Int -> Int
  add a b = a + b

This means that add takes two integers as arguments and returns another integer. This syntax might look strange (you could be asking yourself "How do I distinguish between the argument and return types?") but that's because in Haskell functions are curried[1].

Haskell (and Idris as well) also has parametric polymorphism, so you can define functions which operate on generic types. For example, the identity function:

  id :: a -> a
  id x = x

You can read this as "`id` is a function that takes an argument of type a and returns another argument of type a (so the return type and argument type must be the same)". This means the id function will work for any type of argument (String, Float, Int, etc.). Note that of course this is not dynamic typing, Haskell and Idris are statically typed and therefore all the types are checked at compile-time.

Now, let's move on to Idris. One of the reasons there's a lot of excitement over Idris is because it has dependent types, i.e., you can declare types that depend on values and these are checked at compile-time.

The example GP gave was this:

  app : Vect n a -> Vect m a -> Vect (n + m) a

`app` here stands for append. This is the function that appends two vectors (lists of static length) together. In Idris, `Vect x y` is a vector of size x and type y, so

   Vect 5 Int

is a vector of 5 integers. So, essentially, `app` is a function that takes two vectors as arguments, `Vect n a` and `Vect m a` (this means they can be of different sizes but their content must be of the same type). It returns a `Vect (n + m) a`.

Think about it for a minute. That is amazing! We can be sure that, at compile-time, just by looking at the function signature, our append function, given two vectors of size n and m, will _always_ return a vector of size (n + m).

If we accidentally implement a function that does adhere to the type specification (for example, it appends the first argument to itself, returning a Vect of size (n + n) instead of (n + m)), the code will not compile and the compiler will tell us where is the bug.

[1]: http://learnyouahaskell.com/higher-order-functions#curried-f...

bsznjyewgd 10 years ago | | |

A length n vector of a's, appended to by a length m vector of a's, results in a length (n + m) vector of a's. This is a safety guarantee that is statically checked at compile time rather than dynamically checked at run time.

corysama 10 years ago | | |

You're gonna have to read the article to find out.

tines 10 years ago | |

As a C++ programmer, I'd like to ask how is this different in effect from

    template<typename T, std::size_t X, std::size_t Y>
    std::array<T, X + Y> app(std::array<T, X>, std::array<T, Y>);

pcwalton 10 years ago | | |

Because with dependent typing, the length can be specified at runtime and you still get static checking.

There's also a huge difference between untyped templates and strongly typed generics, but in this particular case the difference is somewhat subtle.

Chattered 10 years ago | | |

I haven't coded in C++ for a very long time, but won't any implementation of that function, and additional templated functions which call it, only get checked at template expansion time? As I understand, buggy templated functions and classes can go unnoticed in a codebase or library simply because no-one has instantiated them in a way which exposes the bug. This is definitely not the case for Idris.

My understanding was that concepts began as an attempt to fix this "problem".

dllthomas 10 years ago | | |

My recollection of C++ (possibly outdated since 11 or 14, though that would - pleasantly - surprise me) is that template parameters must be picked at compile time. With Idris, you can pick X and Y at runtime and still get compile time checking.

jonahx 10 years ago | | |

operationally, i'm not sure if there is. it's certainly a lot prettier, though. i'd be curious if there's a deeper difference, too. it feels like there is, at least to me...

solomatov 10 years ago | |

Actually it's not that easy. Everywhere you have a type you need to proof that the expression really has the right type, and it's not just type annotation, it's a real mathematical proof.

To understand the complexity of the task, try proving that insertion sort is really a sort algorithm. You write more code, and writing it will take more time than you would do in a 'normal' typed language.

serverholic 10 years ago | | |

You can choose how rigorous you want your proof to be.

I'd highly recommend reading Chapter 1, Section 1.3 from Type-Driven Development. The first chapter is free.

https://www.manning.com/books/type-driven-development-with-i...

rfw 10 years ago | |

Unfortunately, the usefulness is limited – the size of all Vects must be known at compile time. While this does prevent a large class of bugs that arise from incorrect compile-time knowledge, the size of the class of runtime bugs affected by this is a lot smaller, as for each differently-lengthed Vect, a distinct instantiation of the Vect type needs to exist at compile-time.

dllthomas 10 years ago | | |

Per my understanding, what's so exciting about this is precisely that you're wrong - it does not need to be known at compile time, but can prove symbolically that the resulting length will be X + Y. And of course, given that, it's not statically creating a distinct instantiation of the Vect type for every N at compile time!

orblivion 10 years ago | | |

I've never actually used such a feature, but my understanding is that this is not strictly true. I think that the compiler can generically understand that this function adds two vectors and their lengths.

I'll see if I can contrive an example. Supposing you have a list L1 of vectors of length 5. You also have a vector V1 with user input, with length x (unknown at compile time). You then make a new list L2 by taking each vector in L1 and append V1 to it. L2 is now a list of vectors of length (5 + x). Even though x is unknown at compile time, the compiler still knows that all vectors in L2 are of the same length. It can make restrictions based on this fact.

It seemed strange to me when I heard this concept, I thought the compiler would need to consider infinite possibilities, but apparently similar things are possible even in Haskell. For instance, you can define a list as a recursive type that holds a value of a Null type (not just a null value of the list type) or another List. Apparently it can still reason about this.

However, these are things I've heard. Hopefully someone with more experience can chime in and let me know if I'm right here.

kmicklas 10 years ago | | |

Not quite. Types can depend on runtime values. This can in effect _force_ the programmer to perform the required validation when accepting foreign data.

dllthomas 10 years ago | |

Working in Haskell, I don't feel like it's "saving me from myself" so much as giving me tools I can use to save myself.

solomatov 10 years ago |

If you are interested in Idris, you might also be interested in agda, which is another dependently typed programming language: https://en.wikipedia.org/wiki/Agda_(programming_language)

nv-vn 10 years ago | |

Other interesting ones to look at are F* (a joint effort from Microsoft Research and INRIA to create an ML-like dependently typed language) and ATS (a low-level, fast dependently typed language meant to replace C). What's cool about both of these is that they put emphasis on allowing imperative programming despite being dependently typed and seeming very functional.

brudgers 10 years ago |

Idris homepage: http://www.idris-lang.org/

Github: https://github.com/idris-hackers

nihils 10 years ago |

This is actually perfect for a library on algebraic structures I've been trying to make in Haskell. For example, how does one distinguish between elements in the Dihedral group of order 10 vs. Dihedral group on order 16 when obstensibly, they have the same representation. For now, I think Haskell programmers use type-level arithmetic libraries, but this is a much better solution.

tikhonj 10 years ago | |

Haskell is actively moving in the direction of adding dependent types too. I believe phase 1[1] of the plan[2] is slated for GHC 8.0 (the upcoming release), and I'm sure the rest of it will follow soon.

It's pretty exciting!

[1]: https://ghc.haskell.org/trac/ghc/wiki/DependentHaskell/Phase...

[2]: https://ghc.haskell.org/trac/ghc/wiki/DependentHaskell

athesyn 10 years ago |

Bit disappointed it makes the assumption the reader knows Haskell, that lost me immediately.

skosch 10 years ago | |

You know what, I've found it immensely valuable to get acquainted with Haskell, even though I've never used it (and likely never will). The concepts are timelessly beautiful, simple to understand, and can feel enlightening to run-of-the-mill imperative/OOP programmers.

What's more, it seems to me that Haskell syntax is the lingua franca when discussing anything related to data types and functional programming these days. Those ->'s are everywhere, it's useful to know what they stand for.

Just skim a few chapters of learnyouahaskell.com; you won't regret it.

enraged_camel 10 years ago | | |

I'm more of a "learn by building things" type of learner. What kinds of things can I build with Haskell?

For example, I got into Ruby via Rails, because Rails lets you quickly prototype simple web apps. So I could go from "I wish I had an app that does X" to actually building it, deploying it and sharing it with others. What would a similar "learning flow" look like in Haskell? (doesn't have to be web-based)

Put another way, when I come across a problem, how do I recognize it as the type of problem that is best solved using Haskell, vs. an imperative language?

blt 10 years ago |

I recently had some murky thoughts on my ideal Matlab replacement, and it would have a feature like this. It would be huge for array-oriented programming:

    func train_model(X: [n d], y: [n 1])

So many lines of code are spent verifying that the sizes of two function arguments are compatible.

shele 10 years ago | |

Other example: Fixed size matrix multiplication in https://github.com/SimonDanisch/FixedSizeArrays.jl depends on element type T and dimensions MxN and NxR:

    function *{T, M, N, R}(a::Mat{M, N, T}, b::Mat{N, R, T})

jdminhbg 10 years ago | |

You can do this in Julia now, with many of the affordances you'd expect from Matlab.

    function train_model(X::Array{Float64,d}, y::Array{Float64,1})

MichaelBurge 10 years ago |

I haven't used Idris, but it sounds like someone I might want. I really want a tool that lets me mix and match operational code with proofs. It's common now to write a unit test while debugging something, less common to write a randomized property tester, and rare to see a theorem prover being used.

It would be fantastic if I could casually throw in a proof while debugging a hard problem.

farhanhubble 10 years ago |

Thanks for the link. I was looking into liquid Haskell and refinement types and thought that it's a great idea but rued the fact that it wasn't built into the language. I am definitely going to try Idris for one of my projects.

muhuk 10 years ago |

But does it allow specifying vectors that has no less than three and no more than seven items? Or vectors with even number of items?

More importantly can we possibly implement church numerals in Haskell? /rhetorical

dllthomas 10 years ago | |

"But does it allow specifying vectors that has no less than three and no more than seven items? Or vectors with even number of items?"

Yes.

framp 10 years ago |

Great article and really interesting language!

I wonder how Idris is going to be affected when Dependent types come to Haskell as well (announced at last ICFP)

jonsterling 10 years ago | |

it's nice for the haskell folks that they'll be getting some sort of dependent types, but suffice it to say that they will be of a very different sort than the Idris ones.

Not to mention, there is hope of giving a semantics to Idris since it is based on a fairly routine variant of type theory, whereas I don't think there is any hope at all of understanding "which" type theory the Haskell folks shall have implemented.

lambdasquirrel 10 years ago | |

I think it will be a while, if ever, before we're able to handle passing types around as values, in Haskell, the way we do in Idris and Agda. Dependent typing isn't just some feature. It touches a whole other way of thinking about things.

If you go all the way with Agda, you have significantly constrained recursion to the point that the language isn't turing-complete anymore, but in return you get termination checking and other nice things. As it turns out, most of the code we write doesn't need unbounded recursion. Seriously, can you think of the last time you wrote something like that?

And I think that'll be a hard pill for people to swallow, much like how it was really hard to sell memory safety to C/C++ guys back in the day.

vzaliva 10 years ago |

I would suggest to use Python or some other mainstream language instead of Haskell in example to contrast to Idris. People who know Haskell most likely be already familiar with dependent types and for people not familiar with Haskell syntax could be confusing (as some comments indicate).

cantankerous 10 years ago | |

Many mainstream languages like Python don't even have a syntactic notion of types. The best you could really do is say that Idris would get you some typing reification that you might do in those languages at no cost, but it would seriously sell short the power of dependent types like you see in Idris.

Ace17 10 years ago |

I don't get it. Is it just about rediscovering std::array?

dllthomas 10 years ago | |

Can you append a std::array<int, x> and a std::array<int, y> to get a std::array<int, x+y> with x and y chosen at runtime?

Ace17 10 years ago | | |

Of course not. In my understanding, the article was about static type checking, though.

jonathonf 10 years ago |

Does anyone know whether Idris is a response to Haskell's 'Foldable' controversy?

It looks like it was released in 2012 on hackage but I don't know how far back 'Foldable' was envisioned.

tome 10 years ago | |

It is not.

ophelia 10 years ago |

Is Idris related to Idris Elba, the British actor, by any chance?

andolanra 10 years ago | |

The story, if I recall correctly, is that the programmer Edwin Brady had a previous project that was a proof engine, and lacking a suitable name for it, named after an older British children's cartoon called Ivor the Engine.[^1] When it came time to name a newer project, he decided to name it after another character from the same show: Idris, the little red Welsh dragon. This, incidentally, is why the Idris language's logo is a stylized red wing.

[^1]: Ivor the Proof Engine is here, although apparently not actively updated any more: https://github.com/edwinb/Ivor

archgoon 10 years ago | | |

Thank you, I had read the FAQ:

http://docs.idris-lang.org/en/latest/faq/faq.html

but could not infer why a dragon from Ivor the engine had been chosen. That the author had previously written a proof engine makes all the sense now. :)

jonathonf 10 years ago | |

Because an existing name (from all various of the world) wasn't around before some bloke on the telly?

systems 10 years ago |

"Indentation significant syntax"

in my opinion, one of the worst ideas to plague many new languages

brackets are really, seriously, honestly a better visual cue for grouping

int func() { while dosomething() { dosomething() dosomething() doanotherthing() } dosomething() } def func(): while dosomething(): dosomething() dosomething() doanotherthing() dosomething()