The Ruby+OMR JIT

109 points by x3qt 9 years ago | 67 comments

I expected CPython to have something like this, especially in 3.x but nothing.

If Ruby can get faster than Python sooner, I'd switch focus to Ruby (and somewhat make Go less of a priority for us).

kweinber 9 years ago | |

Don't count on Ruby getting 10x faster anytime soon. The meta-object model and method-missing reliance of many of the libraries make performance optimizations of JITs a very limited enterprise.

chrisseaton 9 years ago | | |

Optimising method-missing and the rest of the meta-object model in a JIT is now a solved problem - I solved it for my PhD http://chrisseaton.com/phd/

It is actually possible to completely remove the overhead of method-missing.

BenoitP 9 years ago | | |

I hear the Truffle+Graal combination is getting very good results about Ruby. Here is video about it (pointed to the missing-method example): [1]

They're hitting 35x in some benchmarks [2]. Most benefits being in workloads where Ruby has to continuously talk to C; the result-passing part gets compiled away.

It is available in Java 9, if I'm not mistaken. LLVM IR bytecode ingestion is in preparations too [3].

[1] https://youtu.be/b1NTaVQPt1E?t=48m17s [2] http://jruby.org/bench9000/ [3] https://github.com/graalvm/sulong

vidarh 9 years ago | | |

It's true that many libraries would perform poorly because of a reliance on method_missing. But what can be made quite fast is code that either uses define_method instead, or uses define_method on the first failed call.

In fact, a tracing jit could even make the method_missing case fast by automatically specialising and optimizing missing coupled with a vtable/dispatch table.

My forever-in-progress ahead-of-time Ruby compiler uses vtable based dispatch for every method name that's seen in the program at least once, and that's usually most of them (unless people e.g. construct method names dynamically at runtime).

To handle method_missing, it creates thunks for each method name and fills the missing vtable slots with those, which is similar to what I suggest above - the next step of dynamically optimising method_missing for called symbols and replacing the vtable thunks would be a relatively minor step in a JIT.

I don't think anyone expects Ruby to get dramatically faster in the very short term, but there's lots of opportunity to make most Ruby code much faster in the medium term.

I think we'll also start seeing implementations that do things like JIT influence how people write Ruby, because a lot of things won't matter much for MRI performance but will be a huge deal for implementations that uses compilation techniques, so it's possible to get a lot more "compiler-friendly" Ruby while still writing clean Ruby that runs well on other implementations.

e.g. the above "define_method on method_missing" basically boils down to (pseudo code):

    def method_missing sym, *args 
        raise suitable exception if sym doesn't meet right criteria
        define_method(sym, args...) do 
           ... whatever ..
        end
        send(sym,*args)
    end

If it lets an implementation speed up its (ab)use of method_missing enough, you'll see people adopt stuff like that.

munificent 9 years ago | | |

Smalltalk has those same features (which is where Ruby got them from). Almost all of the high performance JIT technology we use these days was originally invented by Smalltalk VM hackers. They invented that stuff specifically to make that kind of code go fast.

lmcnish14 9 years ago | | |

They've already stated that it'll be a few years before they'll get ruby 3X faster.

rurban 9 years ago | | |

No. Tinyruby/Potion is about 20x faster. Doable, but too complicated for the current maintainers

sdegutis 9 years ago | | |

And when you take those two things away, you pretty much just have Java. Which, while it can be optimized, it's also Java.

pjmlp 9 years ago | |

It already is, kind of, it is called Crystal, but yeah it isn't the same thing.

RX14 9 years ago | | |

If you don't mind starting from scratch and using a statically typed language, crystal is amazing. However, you will not get vary far porting any large ruby project to crystal without a rewrite from scratch.

orf 9 years ago | |

What's wrong with PyPy?

progman 9 years ago | | |

Developers who like Python syntax and want C performance should seriously consider Nim.

http://nim-lang.org

my123 9 years ago | | |

Good Python 3.x support...

brianwawok 9 years ago | | |

It's like the bad parts of Java mixed with Python to get a little more perf

rweichler 9 years ago | |

Or just switch to LuaJIT.

rubyfan 9 years ago |

Anyone have any performance comparisons against other Ruby implementations? I assume performance is the main reason someone would adopt this right?

magaudet 9 years ago |

Author here: Feel free to AMA!

magaudet 9 years ago | |

Someone out of band poked me to give an eye as to what I see the roadmap is. I'm currently working on getting the JIT working with Ruby trunk (https://github.com/rubyomr-preview/ruby/tree/ruby_2_4_omr_pr...).

Once that's stable, then I'd like to focus on trying to some of this code integrated into MRI upstream, perhaps as an experimental branch for the 2.5 development cycle, or as something that can be compiled in optionally.

In parallel, I'd like to work on improving performance . We've not put a lot of effort into performance, and have instead focused on compatibility and currency, so that we have a good base from which to grow performance on top of.

noahdesu 9 years ago | |

Slightly off topic since the post is specific to Ruby, but are you able to compare OMR to Truffle and Graal?

magaudet 9 years ago | | |

Not as well as I should be able to, for sure.

In my mind, I look at Truffle and Graal as a potential way forward to build new high performance JVM languages.

OMR I see as a way to build language runtimes in C/C++, and have a production pedigree.

rubyfan 9 years ago | |

Any performance comparisons to vanilla MRI, JRuby or JRuby Truffle+Graal?

appleflaxen 9 years ago |

Tried looking in github, project-specific pages, wikipedia and still have no clue:

What does OMR stand for?

<something><something>runtime?

magaudet 9 years ago | |

Officially, OMR is a meaningless title, like LLVM. Similar to LLVM, it once stood for something, but then we realized that it didn't actually match the project's 'charter' quite as well as we had hoped... but had grown fond of the name (also... finding new names is super hard).

Original definition was 'Open Managed Runtimes', but we can do more than just managed runtimes with OMR technology, and so that seemed to sell it short.

coldnebo 9 years ago | |

Doesn't seem to stand for anything (at least it's not defined in the project charter).

If I had to guess at it:

Open Meta-Runtime

But I could see how that might be misinterpreted compared to their goals. Not all cap names are acronyms?

https://projects.eclipse.org/proposals/omr

forkandgrok 9 years ago | | |

It stands for Open Managed Runtime.

poorman 9 years ago |

"Right now, it only works for Ruby 2.1.5; however, I'm slowly plugging away at moving forward towards Ruby 2.4."

The irony.

But in all seriousness, this is awesome.

magaudet 9 years ago | |

Whoops! That's actually an editing mistake: The Ruby+OMR preview is actually for Ruby 2.2 right now.

You can follow the work in progress on trunk (what will become Ruby 2.4 in December) here: https://github.com/rubyomr-preview/ruby/tree/ruby_2_4_omr_pr...