Little is a statically typed, C-like scripting language

Little is a statically typed, C-like scripting language(little-lang.org)

192 points by Thedarkb 5 years ago | 129 comments

asicsp 5 years ago |

http://www.little-lang.org/why.html is pretty interesting

>We (BitKeeper folks) did our GUI interfaces in Tcl/Tk years ago because it meant we could have one gui person and get the same gui tools on Windows, Mac, Unix, and Linux.

>While some of us could switch from C to Tcl easily, our pointy-haired boss could not, he's mostly C and would lose about a half a day to get back into Tcl.

>Success was realized when one of our engineers, who is not a Little fan, fixed a bug in a patch that flew by in email without realizing it was Little instead of C

luckydude 5 years ago |

Wow, just noticed this. I'm the guy who paid for Little, a bunch of other people did all the work.

I'm surprised to see it getting some attention but happily so. Little is what I'd like C to evolve towards, there is a lot of useful (to me) stuff in the language.

I'll wander through the comments and reply where I can.

luckydude 5 years ago | |

I just realized I didn't give credit to all the people who worked on Little. So here goes. Tim Daly did the first pass. Oscar Bonilla stepped up, I still remember him saying to Tim "we need an AST" and Tim said I can do this. We needed an AST, Oscar was right about that and a lot of other things. Rob Netzer, my roommate from college and former Brown tenured prof, he did the most heavy lifting in the Little compiler. Damon Courtney was our GUI guy, he had huge influence in Little.

Jeff Hobbs from the tcl community helped as well. I have pictures of that group of people. Jeff helped a lot, he wanted this, I could say why but I don't want to speak for him.

Little is what I'd like C to be but those guys made it happen.

I'm amazed that Little got some traction, happy that it did for a moment, those guys deserve all the credit, I was a whiny dude wanted a more C like thing and they gave it to me.

marktangotango 5 years ago | |

I'm curious if you've written about the decisions around licensing that essentially killed the bitkeeper business by inspiring Linus to create git? What are your thoughts around that today?

luckydude 5 years ago | | |

Hind sight is 20-20. The BitKeeper business had a good run, we were around for 18 years. It made enough that I and my business guy are retired off of what we made.

On the other hand, we didn't make enough for everyone to retire if they wanted to. We had a github like offering and it's pretty clear that we should have put a bunch of money into that and open sourced BitKeeper.

All I can say is it is incredibly hard to make that choice when you have something that is paying the bills. I tried to get Sun to do it with the BSD based SunOS and they wouldn't. And even though I had that vision for Sun, when it was my livelihood, I couldn't see the path to doing so.

Shoulda, coulda, woulda, my biggest regret is not money, it is that Git is such an awful excuse for an SCM. It drives me nuts that the model is a tarball server. Even Linus has admitted to me that it's a crappy design. It does what he wants, but what he wants is not what the world should want.

It says a lot that we have a bk fast-export and we can incrementally run that and get idempotent results. As in go from BK to Git on an ongoing basis, have two people do it in parallel and they both get bit for bit identical results. If you try and go the other way, Git -> BK, if you do it in parallel you get different results because Git doesn't store enough information, so BK has to make up the missing bits.

Git has no file create|delete|rename history, it just guesses. That's my biggest regret, I wish Linus had copied that part.

luckydude 5 years ago | | |

BTW, Linus switched to Git 2005. BitKeeper didn't give up and turn to open source until 2016. So we had a 10 year run after Git showed up where people still paid us.

It was good run, not many software companies get an 18 year run. I'm fine with it, would have liked to do more for my people.

Though, when we were shutting things down and I was bumming that I had not gotten retirement money for all of my people, one of them said something like "Dude, are you kidding me? My best friend barely knows his kids, he is out the door by 7am to fight Houston traffic and not home until close to 7pm. My commute is from my bedroom to my office down the hall. I've got to help my wife with the kids, I see the kids all day every day, my life is infinitely better than my friend's life and you gave that to me. You're fine."

I'm a wimp, I teared up a bit, that was the nicest thing he ever said to me. It's not just about money.

rdpintqogeogsaa 5 years ago | |

I just wanted to say thank you for believing in the SCCS weave when making BitKeeper. It is an incredibly elegant design (though I've failed to properly implement it myself yet).

luckydude 5 years ago | | |

The SCCS weave was our secret sauce. Tichy did the world a huge disservice in his PhD about RCS (like that was worth a PhD, come on) where he bad mouthed SCCS's weave (without understanding it, or maybe he was spreading misinformation on purpose, he implied that SCCS was forward deltas where RCS is backwards deltas, see below for what that means).

When we were still in business, each new SCM that came out, we'd hold our breath until we looked at it and said "No weave!"

For those who don't know, the SCCS weave is how your data is stored. Most people are used to something like RCS which is patch based. For the RCS trunk, the head is stored as plain text, the previous delta is a reverse patch against the head, lather, rinse, repeat. Branches are forward deltas, so if you want to get something on a branch, you start with the head, apply reverse deltas until you get to the branch point and then forward deltas until you get to the branch tip. Ask Dave Miller how much he loved working on a branch of gcc, spoiler, he hated it. With good reason.

SCCS has a weave that is not aware of branches at all, it only knows about deltas. So you can get 1.1 in exactly the same amount of time as it takes to get head, it is one read through all the data. bk annotate (git blame) is astonishingly fast.

And merges are by reference, no data is copied across a merge. Try that in a patch based system. How many of you have been burned because you merged in a branch full of code that someone else wrote, and on the trunk all the new data from the branch looks like you wrote it, so you get blamed when there is a bug in that code? That's because your brain dead system copied code from the branch to the trunk (Git does this as well, that's what the repack code is trying to "fix", it is deduping the copies).

Weaves are the schnizzle, any SCM system that doesn't use them today is so 1980.

amacbride 5 years ago | |

Avocet (CodeManager) ruined me for other source control systems for years and years — I still look back on it fondly.

luckydude 5 years ago | | |

That was me as well, though I wrote it in perl4 and C. My version was called NSElite. The Solaris kernel was developed under NSElite, some stuff is here:

http://mcvoy.com/lm/nselite

2000.txt documents the first 2000 resyncs (think bk pull) of the kernel.

Avocet was what you got when you took all my perl code and handed it to the tools group and they rewrote it in C++ (which they later admitted was a horrible idea). The only thing of mine that they kept was smoosh.c and that was because not a single one of them had the chops to write that code (yeah, there was no love lost between me and the tools group).

BitKeeper is what Avocet could have been if Sun had not stopped me from doing any more work on NSElite (I was 1 guy who was coding circles around 8 tools people and they didn't like it). Shrug. C++ was just wrong, perl4 was just way faster to code in, and when I needed performance I coded in C. It's not my fault they picked the wrong way to go about things. (That, BTW, was the first time I ever personally saw that you really can have one guy who can do the work of 8, almost, but not quite a 10x programmer :-)

fuball63 5 years ago |

Not to be confuse with lil, which is a small scripting language based on TCL: http://runtimeterror.com/tech/lil/

I think this is a super interesting project. It reminds me what Groovy is to Java, but backwards. Groovy is a “looser” version of Java that compiles Java. Little is a “stricter” version of TCL that compiles TCL.

narrator 5 years ago | |

TCL had some bad features that kind of killed it. For example, "upvar" was a really bad idea. Bad features tend to kill languages over time. Everyone used to use Perl in the late 90s. It had too many bad features though, and nobody wanted to maintain those programs.

derefr 5 years ago | | |

I've always been surprised that nobody has tried to take a "The Good Parts" subset of a big language (C++, Perl, etc.), codified/formalized it as its own language, and then attempted to popularize the new reduced language as a distinct effort/project/community to that of the original language.

One could release this "language" as a distribution of the inner language's compiler together with a wrapper (like C++ originally was to C), that, rather than adding features and compiling down, just analyzes the source file and errors out on use of forbidden syntax; or, if no forbidden syntax is used, just passes your code straight through to the inner compiler. A bit like a pre-commit-hook style checker, but a pre-compile-hook style checker.

nrclark 5 years ago | | |

I always thought upvar was kind of a cool feature, and allowed laser-specific targeting in places where you'd use a global otherwise in C. Is there any specific way that it's caused problems?

isr 5 years ago | | |

Hmm, I would disagree with this (quite strongly). Not that tcl doesn't have misfeatures (of course it does), but that `upvar` or `uplevel` are one of them.

Those commands effectively make any tcl function be able to operate as an f-expr (in old lisp parlance). Effectively (this is a simplification), an fexpr is a runtime macro, as opposed to the more traditional lispy compiletime macro.

Its what makes tcl feel more like a lisp-for-strings. Much of the truly horrible tcl code out there in the wild is from folks who try to use tcl as just another c-style scripting language.

Treat it more as a lisp, and it some of its inherent elegance shines through.

My $0.02 anyway ...

csande17 5 years ago | | |

What's upvar?

BruceEel 5 years ago |

   Compiles to Tcl byte codes

Interesting, I didn't even know there was such a thing. Good to see <> and =~ live on.

simias 5 years ago | |

=~ is nice and convenient but I really don't think <> was worth bringing back. I'm sure people who never used Perl could figure out what `buf =~ /${regexp}/` does, but I wonder if they'd be able to figure out the `while (buf = <>)`.

Perl has a lot of good ideas and things I find myself missing a lot when I use Python or JS (autovivification being probably #1) but IMO these one or two symbol magic variables would generally be improved if they were more descriptive, with maybe one exception for $_/@_.

Although I must admit reminiscing about this made me realize how much I miss Perl now that I'm forced to use Python for work.

jiofih 5 years ago | | |

Autovivification was one of the most painful features I’ve had to live with - in a large codebase it completely erases trust on any kind of defined() check and breaks all sorts of things unexpectedly.

Yet another horrible hack in Perl that for some reason is advocated for. Optional chaining / null propagation is a much, much better idea and shouldn’t have been any harder to implement.

luckydude 5 years ago | | |

I wrote most of my first source management system (NSElite, mentioned elsewhere in this thread) in perl4. I was learning perl at the time and my first and second efforts were awful. Perl really lets you get sloppy and create unmaintainable code.

My 3rd rewrite was very stylized and, I felt, maintainable. Which proved to be true as I had to fix bugs in it.

I did weird stuff like using $whatever as the index into the @whatever array.

But I digress. On the <>, Little has argv so you can do

int main(string argv[]) { int i; string buf; FILE f;

    if (defined(argv[1]) && streq(argv[1], "-") && !defined(argv[2])) {
        while (buf = <STDIN>) bputs(buf);
    } else {
        for (i = 1; defined(argv[i]); i++) {
            if (defined(f = fopen(argv[i], "r")) {
                while (buf = <f>) puts(buf);
                fclose(f);
            } else {
                 fprintf(stderr, "unable to open '%s'\n", argv[i]);
            }
        }
    }
    return (0);

}

but why would you want to when all of that is

int main(string argv[]) { string buf;

    while (buf = <>) puts(buf);
    return (0);

}

I mean, come on, that's cat(1) in 8 lines of code.

edit: I need to learn hacker markup. My code looks like crap.

nightowl_games 5 years ago |

Curious on performance benchmarks. For me, I'm comparing this to wren:

https://wren.io/

luckydude 5 years ago | |

There are some benchmarks here:

http://mcvoy.com/lm/L/tcl/tests/langbench/

some results in the README. Perl holds up well. These are probably 10 years old though.

tyingq 5 years ago | |

Tcl is historically very slow. Especially for synthetic CPU intensive benchmarks. However, since it's so easy to interop with C, it didn't seem to matter much in the real world. You just put anything performance sensitive in C and left the bits that didn't matter in tcl. Some benchmarks: https://github.com/trizen/language-benchmarks

Thedarkb 5 years ago | | |

It takes Little 3.53 seconds to find the 33rd number in the Fibonacci sequence recursively versus 30.23 seconds for Tcl8.6 on my i5-3320M.

rkeene2 5 years ago | | |

Additionally, you can compile Tcl to machine code with TclQuadCode for up to a 66x speed improvement

Tade0 5 years ago |

I originally read it as "Life is..." and thought "that's an interesting take on things".

slazaro 5 years ago | |

Make it a T-shirt, the fake-deep tone for a nonsensical statement is kind of ironic/cool.

Tade0 5 years ago | | |

I think there's a very real possibility that trying to do that I would dissolve from the embarrassment like the witch in The Wizard of Oz.

0xbadcafebee 5 years ago |

Why are there so many different languages that are almost the same except for one or two attributes? Why not make one language that can do everything?

If you can make one language strongly typed, and one weakly typed, then you should be able to build one language which can do either/or, depending on a compiler flag. Then you simply decide before you start writing your code whether you want to write it weakly typed or strongly typed, and pass the correct compiler option.

Take that same idea, but add in every language's quirks, and just enable/disable them. Then we wouldn't need to constantly reinvent languages, because we'd have one that can do everything.

Otherwise we're going to keep re-writing the same damn thing for hundreds of years, and that just seems like such a pointless waste of effort.

arunix 5 years ago | |

I think that's what Larry Wall was trying to do with Perl6/Raku:

https://thenewstack.io/larry-walls-quest-100-year-programmin...

lizmat 5 years ago | |

Have you looked at the Raku Programming Language? https://raku.org using the #rakulang tag on social media.

tyingq 5 years ago |

The repo is odd, it's hard to tell where the actual little-lang code is. I guess it's in the tclXXX directory?

https://github.com/bitkeeper-scm/little-lang

rbsmith 5 years ago | |

All the files that begin with L in :

https://github.com/bitkeeper-scm/tcl/tree/master/generic

mannykannot 5 years ago |

This language clearly avoids some of the run-time errors that can occur with C, but I would like to learn a little more about the remainder. For example, if you make an out-of-bounds assignment to an array, the array is grown to accommodate it (for +ve offset only, I assume) - but what about out-of-bounds retrieval?

luckydude 5 years ago | |

You get undef, that's part of the reason we added an undef concept (it's a value that isn't a value, though if you treat it like an int I believe it is zero, like a string and you get "").

CyberDildonics 5 years ago | | |

I think that is a giant design mistake since you are pushing the discovery of an error to some place else in the program, potentially very far from where the error occurred.

IshKebab 5 years ago | | |

So there's `undef` and `null`? I wonder if there's a popular language that made the same mistake you could have learnt from :-P

CyberDildonics 5 years ago | |

What would you want a language to do if you try to access an array offset that doesn't exist except for give you a clear error?

luckydude 5 years ago | | |

You are kind of making my point. In tcl, they'd just give you "", but you can't tell the difference between "past the end of the array" or an element where you said

set foo[i] = ""

In Little you can tell, we'll return undef (your clear "error" though in these languages it is a supported feature, not an error). So we support the auto expanding array but give you that extra bit of info that you are past the end.

mannykannot 5 years ago | | |

Well, it's that line of thought that prompted my question. It is, of course, a well-known source of problems in C that it will uncomplainingly dereference an invalid pointer.

nanofortnight 5 years ago |

Is Little embeddable? This seems like a perfect scripting language for embedding into a larger C application.

luckydude 5 years ago | |

I would think so but I haven't done it. People embed tcl all the time, perl/tk is perl with a tcl interpreter embedded just so they can get at the tk part (gui stuff).

Thedarkb 5 years ago | |

I asked Oscar Bonilla on Twitter a while ago and he said that it should be the same process as Tcl, but beware I haven't tried.

maskedoffender 5 years ago |

Bellard's tiny C compiler (tcc) can execute C so fast it's as if it were a scripting language.

https://bellard.org/tcc/

swagonomixxx 5 years ago |

> undef(argv[1]); // left shift down the args

Can someone explain what this does? Is this some Perl or Tcl thing? Unfortunately I've never used either :)

luckydude 5 years ago | |

I'll grant you it is sort of weird. I think that's a perl thing, we just copied how they did it.

undef is both a function and a (non) value. It is the main reason Little never got pushed back into tcl, the tcl crowd hates the idea that there can be a value for a variable that is undefined. I found that very useful, for example, undef is the error return from any function. Just made sense to me, didn't make sense to the Tcl people.

tyingq 5 years ago | |

It's not a Perl thing. In Perl, that would set argv[1] to undef. It would not delete or left-shift @ARGV. There is a delete() function that acts similarly, but is discouraged to use on regular arrays. Shift() would be more appropriate in this case.

Given the context, in little-lang, it appears to delete argv[1] and shift all of the right of that down, such that argv[2] becomes argv[1] and so on. That's so that the the "while (buf = <>)" construct used right below it doesn't process the regex as if it were a file to "grep" through.

In Perl, you would typically do it this way...

  if (!defined(my $regex=shift(@ARGV))) {
      die("usage: grep regexp [files]");
  }

synergy20 5 years ago |

so this is a c-like-interface for tcl/tk libraries? I was thinking it's a c-style lua script alternative so I can use it on embedded boards.

sneak 5 years ago |

I support any language that lets me use unless (if!) and until (while!).

I really wish more languages would adopt this syntactic sugar.

peteretep 5 years ago |

Strongly recommend removing the Perl camel from the logo as it’s a trademark owned by a commercial entity

luckydude 5 years ago | |

Who owns it? OReilly?

peteretep 5 years ago | | |

Yep

rurban 5 years ago |

I really like the syntax. But it should be compiled to lua, not tcl.

luckydude 5 years ago | |

Well the compiler is open source, have at it :-)

Personally, I would love a gcc --little dialect complete with a String type (and others) that is garbage collected and auto resized just like tcl/Little. With all the other Little goodness in there. Man, that would make C super pleasant. And it wouldn't be a new syntax like Go/Rust/whatever.

dilawar 5 years ago |

is there a list of acomputer languagesn(dead or alive) and spoken languages (dead or alive)? Would like to see at what time computer languages are likely to outnumber spoken language.

rvense 5 years ago | |

The highest estimate for number of natural languages I've seen is about 7,000 living. But I think you'd have to have a very, very restrictive definition of programming language for it to be lower than that.

endergen 5 years ago | | |

Definitely depends on the definition, it seems every programmer and their mother has some toy programming language they poke at.

zabzonk 5 years ago | |

Unsurprisingly, Wikipedia has both.

forgotmypw17 5 years ago |

this is amazing, like a dream come true!

vram22 5 years ago |

Has any one else noticed that programming-language-topic threads on HN seem to come in batches, sometimes? I'm not complaining. I like it, since I am a language fan, though not an implementer or lawyer. I have seen this phenomenon at least a few times in the last few years. (Did not check much during Covid.)

JNRowe 5 years ago | |

I've been enjoying this series of "Breakfast With Forth Week", best one yet.

It had left me wondering what had caused it. There was a thread a few weeks ago where the old guard were describing an actual attempt to game the system, and I wondered if we were seeing a version of that being played out.

https://news.ycombinator.com/item?id=25787374

vram22 5 years ago | | |

Will check,thanks.

sidpatil 5 years ago | |

It's probably because one link leads to another, and interest in the topic is piqued for a while, until the next new (or old) cool topic comes along.

vram22 5 years ago | | |

Yes, that's probably part of the reason - apart from a fad-of-the-month kind of thing, and also due to genuine interest in the topics.

mastrsushi 5 years ago |

This sort of reminds me of the old MUD VM language Pike https://pike.lysator.liu.se/

luckydude 5 years ago | |

I remember Pike, I looked at it. It was too far away from C for me. I'm a died in the wool C programmer (I started as a kernel programmer and formed some strong opinions there). I get that C is not for everyone but for me, it's enough of a language to do what I want and not filled with this, that, and the other kitchen sink.

So Little looks a lot more like C than Pike does. And I like it that way. It's not for everyone but C programmers will probably like it.