Julia 1.6 Highlights

410 points by mbauman 5 years ago | 214 comments

Buttons840 5 years ago |

I recently ported a reinforcement learning algorithm from PyTorch to Julia. I did my best to keep the implementations the same, with the same hyperparameters, network sizes, etc. I think I did a pretty good job because the performance was similar, solving the CartPole environment in the a similar number of steps, etc.

The Julia implementation ended up being about 2 to 3 times faster. I timed the core learning loops, the network evaluations and gradient calculations and applications, and PyTorch and Julia performed similar here. So it wasn't that Julia was faster at learning. Instead it was all the in-between, all the "book keeping" in Python ended up being much faster in Julia, enough so that overall it was 2 to 3 times faster.

(I was training on a CPU though. Things may be different if you're using a GPU, I don't know.)

gdpr 5 years ago | |

Similar experience over here. (G)ARCH models are severely underserved in Python, and I could not be bothered to learn a Probabilistic programming abstraction like Pyro or Stan just to build a quick prototype myself.

Chose Julia instead. Took 4 hours to get everything sorted out (including getting IT to allow Julias package manager to actually download stuff) and have the first model running just putting a paper into code. Since code is just writing the math, this is a vast communication improvement.

After fiddling around withit at home for a week, this was the first professional experience and I'm blown away.

wiz21c 5 years ago | |

could you tell us more ? It looks like a very in depth / interesting benchmark

Buttons840 5 years ago | | |

I will make a blog post about it.

stellalo 5 years ago | |

That’s interesting: did you use Flux?

Buttons840 5 years ago | | |

Yes. I used Flux.

beeforpork 5 years ago |

Julia is such a wonderful language. There are many design decisions that I like, but most importantly to me, its ingenious idea of combining multiple dispatch with JIT compilation still leaves me in awe. It is such an elegant solution to achieving efficient multiple dispatch.

Thanks to everyone who is working on this language!

chalst 5 years ago | |

Julia is the first language to really show that multiple dispatch can be efficient in performance-critical code, but I'm not really sure why: JIT concepts were certainly familiar to implementors of Common Lisp and Dylan.

chalst 5 years ago | | |

I guess the reason is that Julia's type system and standard libraries really guide users to use types that the JIT can unbox as far as possible.

skohan 5 years ago | |

What does it mean exactly? Or what is novel here?

socialdemocrat 5 years ago | | |

The combination. E.g multiple dispatch without JIT would be really slow as you are picking a method to run at runtime based on the type of all the function arguments.

That requires a linear search through a list of all possible combinations of input arguments.

In a single dispatch language like most object oriented languages, you can do a simple dictionary/hash table lookup. Much faster.

With the JIT Julia is able the optimize away most of these super slow lookups at runtime. Hence you get multiple dispatch for all functions but with fantastic performance. Nobody had done that before.

ddragon 5 years ago | | |

This video is good explaining the idea behind multiple dispatch in Julia if you have time:

https://www.youtube.com/watch?v=kc9HwsxE1OY

JulianMorrison 5 years ago | | |

It's like C++ template specialisation, but it happens when the compiler realises you need a particular version. Which may be at runtime, if you changed something.

pjmlp 5 years ago | |

I advise you to check Common Lisp CLOS and Dylan.

eigenspace 5 years ago | | |

What the OP is talking about is julia's method-based JIT strategy coupling very well to multiple dispatch.

JIT is not new, multiple dispatch is not new, and multiple dispatch + JIT also isn't new, but nmo existing langauges combined them in a way that allows for the fantastic, efficient devirtualization of generic methods that julia is so good at.

This is why things like addition and multiplication are not generic functions in Common Lisp, it's too slow in CL because the CLOS is not able to efficiently devirtualize the dispatch. In julia, everything is a generic function, and we use this fact to great effect.

CLOS and Dylan laid a ton of important groundwork for these developments, but they're also not the same.

ddragon 5 years ago | | |

Languages with multiple dispatch aren't rare, but a language having it as the core language paradigm, combined with a compiler capable of completely resolving the method calls during compile time, and therefore able to remove all runtime costs of the dispatch, and a community that fully embraced the idea of creating composable ecosystems is something unique to Julia. I don't think anyone has scaled multiple dispatch to the level of Julia's ecosystem before.

dan-robertson 5 years ago | | |

Common lisp’s typesystem is just not really as useful for this sort of thing. In particular it doesn’t have parameter used types so you can’t make eg a matrix of complex numbers. This breaks (1) a lot of the opportunity for optimisation by inlining (because you can’t assume that all the multiplications in your matrix{float} multiplication are regular float multiplications) or generic code (because you can’t have a generic matrix type and need a special float-matrix); and (2) opportunities for saving memory with generic data structures because the types must be associated to the smallest units and not the container (eg every object in a float matrix must be tagged with the fact that it is a float because in theory you could put a complex number in there and then you’d need to know to do a different multiplication operation).

I guess you could try to hack together some kind of templating feature to make new type-specific classes on the fly, but this won’t work well with subtyping. Your template goes system could probably have (matrix float) as a subclass of matrix, but not of (matrix real) or (matrix number). I think you’d lose too much in Common Lisp’s hodge-podge type system.

A big innovation of Julia was figuring out how to make generic functions and multiple dispatch work in a good way with the kind of generic data structures you need for good performance. And this was not a trivial problem at all. Julia’s system let’s you write generic numeric matrix code while still having float matrix multiplication done by LAPACK, which seems desirable.

The other thing is that Julia is a language where generic functions are a low-level thing all over the standard library whereas Common Lisp has a mix of a few generic functions (er, documentation is one; there are more in cltl2), a few “pre-clos” generic functions like mathematical functions, sequence functions and to some extent some array functions, and a whole lot of non-generic functions.

iib 5 years ago | | |

Wikipedia has a nice table [1] on the Multiple Dispatch page, that describes one studies' findings about the use of multiple dispatch in languages supporting it, in practice.

Although CLOS and others do support it, Julia seems to take the cake by most metrics, highlighting that it is a core paradigm of the language, more so than in the others.

teleforce 5 years ago | | |

Even better check Stanza language for modern version and interpretation of Lisp, Scheme and Dylan. It supports multi-method/multiple dispatches, hybrid dynamic and static typing, high and low level programming to name a few productive features.

[1]http://lbstanza.org/

MisterBiggs 5 years ago |

I've been running the 1.6 release candidates, and the compilation speed improvements have been massive. There have been plenty of instances in the past where I've tried to 'quickly' show off some Julia code, and I end up waiting ~45 seconds for a plot to show or a minute for a Pluto notebook to run, and that's not to mention waiting for my imports to finish. It's still slower than Matlab for the first run, but it's at least in the same ballpark now.

peatmoss 5 years ago | |

In terms of “don’t make me think about why Julia is fast but feels slow for casual use” this release is going to be a game changer.

I just did a “using Plots” in 1.6.0, and it was fast enough to not care about the delta between Plots and, say, R loading ggplot.

Huge kudos to the Julia team.

sieste 5 years ago | | |

I agree, this is a game changer. Previously time to first plot (TTFP) was >1 minute for me, which made julia completely unusable for my day-to-day exploratory data analysis, visualisation, quick random number experiments etc. Now TTFP is less than 10 seconds. I'm now ready (and excited) to jump ship from R and python!

Sukera 5 years ago | |

What kind of speed do you see now?

MisterBiggs 5 years ago | | |

No idea if this is really a fair comparison but just to get a brief idea of current speeds:

   julia> @time let
          using Plots
          plot([sin, cos])
          end
        11.267558 seconds (17.98 M allocations: 1.114 GiB, 4.83% gc time)

Versus Matlab which probably takes about 15 seconds just to open the editor but plotting is very fast.

   >> tic
   fplot( @(x) [sin(x) cos(x)])
   toc
   Elapsed time is 0.374394 seconds.

Julia is just about as fast as Matlab after the first run for plotting.

leephillips 5 years ago | | |

I’ve also been running the release candidates, and I get something like 6 seconds to first plot on my 2013 laptop, including the time for `using Plots` and the time to actually draw the first plot. A huge improvement; kudos to the developers.

snicker7 5 years ago |

On the package ecosystem side, 1.6 is required for JET.jl [0]. Despite being a dynamic language, the Julia compiler does a lot of static analysis (or "abstract interpretation" in Julia lingo). JET.jl exposes some of this to the user, opening a path for additional static analysis tools (or maybe even custom compilers).

[0]: https://github.com/aviatesk/JET.jl

akdor1154 5 years ago | |

Good gracious, thanks for this. If JET goes anywhere, then that+other goodies in 1.6 mean I will likely switch back from Python+mypy.

celrod 5 years ago | |

> or maybe even custom compilers

Like for autodiff or GPUs.

cbkeller 5 years ago |

See also Lyndon’s blog post [1] about what all has changed since 1.0, for anyone who’s been away for a while.

[1] https://www.oxinabox.net/2021/02/13/Julia-1.6-what-has-chang...

wiz21c 5 years ago |

Whatever improves loading times is more than welcome. It's not really acceptable to wait because you import some libraries. In understand Julia makes lots of things under the hood and that there's a price to pay for that but being a python user, it's a bit inconvenient.

But I'll sure give it a try because Julia hits a sweet spot between expressiveness and speed (at least for the kind of stuff I do : matrix, algorithms, graphs computations).

odipar 5 years ago |

I like Julia (mostly because of multiple dispatch). The only thing that's lacking is an industry strength Garbage Collector, something that can be found in the JVM.

I know that you shouldn't produce garbage, but I happen to like immutable data structures and those work better with optimised GCs.

eigenspace 5 years ago | |

Julia's garbage collector is quite good.

> I know that you shouldn't produce garbage, but I happen to like immutable data structures and those work better with optimised GCs.

If you use immutable data-structures in julia, you're rather unlikely to end up with any heap allocations at all. Unlike Java, Julia is very capable of stack allocating user defined types.

dan-robertson 5 years ago | | |

I think that’s true for small structs made of floats but not true for something like an immutable lisp-style linked list.

superdimwit 5 years ago | |

A low-latency GC would also be great. But again, the JVM only has that due to many millions of dollars spent over decades.

newswasboring 5 years ago | |

I didn't even know julia GC had issues. Care to elaborate?

StefanKarpinski 5 years ago | | |

It doesn’t, it just doesn’t have a $100B GC like Java does. Rather than spending that kind of money trying to compensate for a language design that generates massive amounts of garbage (ie Java), Julia takes the approach of making it easier to avoid generating garbage in the first place, eg by using immutable structures that can be stack allocated and having nice APIs for modifying pre-allocated data structures in place.

adgjlsfhk1 5 years ago | | |

The biggest struggle Julia's GC has is that in multi-threaded workloads, it sometimes isn't aggressive enough to reclaim memory leading to OOM.

noisy_boy 5 years ago |

How easy it is to produce a compiled executable in 1.6? I took a cursory look at the docs but couldn't spot the steps for doing so.

dklend122 5 years ago | |

That's coming. Pieces are there but still need polish and integration. Fib was around 44kb with no runtime required.

Check out staticcompiler.jl

superdimwit 5 years ago | | |

In your experience, what are the current limitations?

ced 5 years ago | |

We did it for production code installed at client sites, and it has been very easy for us. YMMV

triztian 5 years ago | |

I’ve also looked for this, does it mean that I have to install julia on the target machine and it’ll recompile when running?

Or are there steps to produce a binary (much like Go or C or Rust)??

cbkeller 5 years ago | | |

Currently you can make a relocatable “bundle” / “app” with PackageCompiler.jl, but the bundle itself includes a Julia runtime.

Making a nice small static binary is technically possible using an approach similar to what GPUCompiler.jl does, but the CPU equivalent of that isn’t quite ready for primetime.

Sukera 5 years ago | | |

You probably want to check out PackageCompiler.jl (https://julialang.github.io/PackageCompiler.jl/dev/)

systems 5 years ago |

I know its minor, but I still hope they will fix scoping

not that my suggestion is good, but what they have now is bad

https://github.com/JuliaLang/julia/issues/37187

StefanKarpinski 5 years ago | |

Has been fixed since 1.5.

systems 5 years ago | | |

no it has not, they now have different rules for repl, which is part of scope awkwardness

patagurbon 5 years ago | | |

Should that issue be closed then?

3JPLW 5 years ago |

The feature I'm most excited about is the parallel — and automatic — precompilation. Combined with the iterative latency improvements, Julia 1.6 has far fewer coffee breaks.

shmeano 5 years ago | |

or sword fights https://xkcd.com/303/

yesenadam 5 years ago | | |

Ohh, is that what the programmers were doing all through Halt and Catch Fire? Waiting for compilation? I couldn't understand how they got away with acting like naughty 5 year olds, throwing things at each other constantly.

pjmlp 5 years ago |

Love the improvements, all those little details that improve the overall usability.

xiphias2 5 years ago |

Cool, I was thinking of downloading the RC, the demo was so impressive.

Will there be an M1 Mac version for 1.7?

thetwentyone 5 years ago | |

I think so - Julia master branch (1.7 precursor) works on M1, but not all the dependencies that some packages require have been built for M1. Though, I understand that the wonderful packaging system and the folks who work on it are working on it.

> `git clone https://github.com/JuliaLang/julia` and `make` should be enough at this point.

https://github.com/JuliaLang/julia/issues/36617#issuecomment...

staticfloat 5 years ago | | |

Yeah, we've managed to get Julia itself running pretty well on the M1, there are still a few outstanding issues such as backtraces not being as high-quality as on other platforms. You can see the overall tracking issue [0] for a more granular status on the platform support.

For the package ecosystem as a whole, we will be slowly increasing the number of third-party packages that are built for aarch64-darwin, but this is a major undertaking, so I don't expect it to be truly "finished" for 3-6 months. This is due to both technical issues (packages may not build cleanly on aarch64-darwin and may need some patching/updating especially since some of our compilers like gfortran are prerelease testing builds, building for aarch64-darwin means that the packages must be marked as compatible with Julia 1.6+ only--due to a limitation in Julia 1.5-, etc...) as well as practical (Our packaging team is primarily volunteers and they only have so much bandwidth to help fix compilation issues).

[0] https://github.com/JuliaLang/julia/issues/36617

xiphias2 5 years ago | | |

Cool, I'll start with 1.6, but it looks interesting of course :)

fermienrico 5 years ago |

Are the performance claims of Julia greatly exaggerated?

Julia loses almost consistently to Go, Crystal, Nim, Rust, Kotlin, Python (PyPy, Numpy): https://github.com/kostya/benchmarks

Is this because of bad typing or they didn't use Julia properly in idiomatic manner?

JulianMorrison 5 years ago |

BTW, broken link on the documentation page, "The documentation is also available in PDF format: julia-1.6.0.pdf." No it isn't.

mbauman 5 years ago | |

Thanks for the report:

https://github.com/JuliaLang/julia/issues/40190

Edit: it's now building:

https://github.com/JuliaLang/docs.julialang.org/runs/2196972...

JulianMorrison 5 years ago | | |

Much appreciated.

f6v 5 years ago |

Is there a per-project way to manage dependencies yet? I find global package installation to be the biggest weakness of all the R projects out there. Anaconda can help, but it’s not widely used for R projects. And Docker... well, don’t get me started.

adgjlsfhk1 5 years ago | |

Yeah. Julia's had that since (at least) 1.0. Environments are built-in, and you specify project dependencies in a Projects.toml file https://pkgdocs.julialang.org/v1/toml-files/.

staticfloat 5 years ago | | |

Small nitpick; its Project.toml (or JuliaProject.toml, to avoid name clashes) not Projects.toml

UncleOxidant 5 years ago | | |

Or you can activate a local project in a directory, add packages and the Project.toml gets created for you.

oxinabox 5 years ago | | |

Since 0.7 (which was 1.0 with deprecations) In julia 0.6 and before it was exactly as bad as described. (though there were things like Playground.jl to kind of work around it)

krastanov 5 years ago | |

I might be misunderstanding your question, but this post is about Julia, not R. Julia has a pretty great per-project dependency management.

00117 5 years ago | | |

Julia is a competitor of R, hence the comparison.

eigenspace 5 years ago | |

Yes, absolutely. Julia has very strong per-project dependency tracking and reproducibility.

psychometry 5 years ago | |

renv is how R projects do per-package dependency management. Before renv there was packrat. This has been a solved problem for years now...

f6v 5 years ago | | |

Doesn’t mean it’s adopted though.

siproprio 5 years ago |

On 1.6, I tried "] add Plots" and julia got stuck.

StefanKarpinski 5 years ago | |

Please file an issue describing the situation: https://github.com/JuliaLang/julia/issues/new

siproprio 5 years ago | | |

After the issue, I nuked the .julia folder, and now it is taking too long to clone the "JuliaRegistries/General.git" repo.

By the download speed, it might take a few hours before I can plot something.

It also seems that just doing "git clone JuliaRegistries/General.git" is much faster than doing "] add Plots"

dagw 5 years ago | | |

A quick bit of googling shows that people have been complaining about and reporting this for years.

ng55QPSK 5 years ago |

maybe i misread this, but milestone "1.6 blockers" still has 3 open with "1.6 now considered feature-complete. This milestone tracks release-blocking issues." - so how can 1.6 be ready?

kristofferc 5 years ago | |

It is simple. Those issues shouldn't have had the milestone.

ng55QPSK 5 years ago | | |

I see. But you should work on your release process if thing like this happen.

julia> using GalaxyBrain, BenchmarkTools julia> bench = bf""" >++[<+++++++++++++>-]<[[>+>+<<-]>[<+>-]++++++++ [>++++++++<-]>.[-]<<>++++++++++[>++++++++++[>++ ++++++++[>++++++++++[>++++++++++[>++++++++++[>+ +++++++++[-]<-]<-]<-]<-]<-]<-]<-]++++++++++.""" julia> @benchmark $(bench)(; output=devnull, memory_size=100) BenchmarkTools.Trial: memory estimate: 352 bytes allocs estimate: 3 -------------- minimum time: 96.706 ms (0.00% GC) median time: 97.633 ms (0.00% GC) mean time: 98.347 ms (0.00% GC) maximum time: 102.814 ms (0.00% GC) -------------- samples: 51 evals/sample: 1 julia> mandel = bf"(not printing for brevity's sake)" julia> @benchmark $(mandel)(; output=devnull, memory_size=500) BenchmarkTools.Trial: memory estimate: 784 bytes allocs estimate: 3 -------------- minimum time: 1.006 s (0.00% GC) median time: 1.009 s (0.00% GC) mean time: 1.011 s (0.00% GC) maximum time: 1.022 s (0.00% GC) -------------- samples: 5 evals/sample: 1

julia> versioninfo() Julia Version 1.5.1 Commit 697e782ab8 (2020-08-25 20:08 UTC) Platform Info: OS: Linux (x86_64-pc-linux-gnu) CPU: Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz WORD_SIZE: 64 LIBM: libopenlibm LLVM: libLLVM-9.0.1 (ORCJIT, skylake)