Upgrading GitHub from Rails 3.2 to 5.2

Upgrading GitHub from Rails 3.2 to 5.2(githubengineering.com)

510 points by masnick 7 years ago | 263 comments

dasil003 7 years ago |

No mention of the rails:update rake task, which is a very valuable tool to get your boilerplate updated. I'm guessing at GitHub there is so much customization over the years that they wouldn't get much out of it, but it's still a valuable exercise to run through, and it's worthwhile to keep your boilerplate aligned as much as possible since it makes gems, documentation, and everything else more likely to align with shared community experience.

Also, I want to add a big proviso to the lesson "Upstream your tooling instead of rolling your own". Historically in ruby, and now even moreso in node, the ease of pushing new packages and the trendiness of the language at times has led to a lot of churn and abandoning of packages. The trick is to include stable dependencies, and that requires quite a bit of experience and tea-leaf reading to do right. Often times maintaining your own stripped down internal lib can be a performance and maintenance cost win over including a larger batteries-included lib that ends up being poorly supported over time. For example, a lot of people got burned by using the state_machine gem that at one time was very hot and actively maintained but went on to get left in an aggressive limbo (https://github.com/pluginaweek/state_machine).

tcopeland 7 years ago | |

I was always disappointed by rails:update - look at all these crazy changes, I can't do all this stuff! - until I figured out I was using it the wrong way. A good way to use it is to run it in a different branch and then scroll through the diff, picking out items that you want to update. This lets you pick and choose and do even that incrementally, so you can do some easy ones first (like removing unnecessary MIME type registrations) and then move on to more complicated ones.

Longer writeup is at https://thomasleecopeland.com/2015/08/06/running-rails-updat... - it's 3+ years old now, but, hey.

djur 7 years ago | | |

I find `git add -p` useful for that kind of work.

caseyf 7 years ago | |

> Also, I want to add a big proviso to the lesson "Upstream your tooling instead of rolling your own".

I feel like this bit needed more of an explanation about how this applied to GitHub.

If I were to write a post about working in a 10 year old Ruby codebase I'd definitely include "Kill your dependencies" as a bullet point.

tcopeland 7 years ago | | |

> I'd definitely include "Kill your dependencies" as a bullet point

Or at least your monkeypatches!

marcus_holmes 7 years ago | | |

Yeah, I don't really understand about this, especially the security aspect of Gems.

Every piece of externally-maintained code is a security risk, surely? You are implicitly trusting the maintainer of that Gem to not hide bad things in their code. And every Gem that they depend on. If the Gems are old and the maintainer is unpaid and doing other stuff, how sure can you be that they're still vetting all contributions for security? Or that they haven't handed over the maintenance to someone you no longer trust? Or that the maintainer hasn't succumbed to economic pressure and included some malicious code in their Gem?

Or do you have to manually review every single line of code in every dependency yourself? That seems like a lot of work... I would definitely prefer to write my own code for a feature than review 1000's of sloc of someone else's code to spot any problems.

I get that the core Rails codebase gets security-reviewed regularly, but does that happen for Gems? And is it methodical and thorough, or is it just "lots of eyeballs"? And if so, is there a threshold of Gem popularity below which there aren't enough eyeballs to spot problems and the Gem should be considered insecure?

And if you do spot a problem, do you report it and hope the maintainer has time to do something about it? Or do you write a PR and submit it, hoping they accept it? Doesn't that then mean you're maintaining someone else's code base? Again, I would massively prefer to write and maintain my own code than maintain someone's else code (or wait for them to fix a problem that they may no longer care about).

How do you build a secure application for something as trusted as Github while gleefully incorporating all this third-party code?

thibaut_barrere 7 years ago | |

I'll add http://railsdiff.org which I find quite useful too to follow track of framework defaults etc.

timdorr 7 years ago | |

By the way, it's been changed to `bin/rails app:update` since 5.0: https://guides.rubyonrails.org/upgrading_ruby_on_rails.html#...

lunaru 7 years ago |

For my team, this article comes at some interesting timing, since we're bumping into some of the same issues with Rails.

Rails is now a mature framework and part of the problem is its lack of consideration for large existing codebases running in production. While there are nice tools to help migrate (e.g. rails:update) that hit surface issues, the deep problem is that there are a lot of decisions made going from version to version that are obviously unfriendly to established projects. e.g.: https://github.com/rails/rails/issues/27231

Additionally, there are a lot of gems that are losing momentum, which are near-core to Rails. e.g.: https://github.com/thiagopradi/octopus/issues/490. This is a side effect of the above issue, where the alternatives to Rails are taking a lot of the community away to focus on newer/shiner things. Fortunately, we have companies like GitHub and Shopify that are still very much invested in the success of the ecosystem.

All that said, it's still a great framework to go from 0 to production with a new idea or project.

Other ecosystems we're entrenched in (Node for example) have their share of issues as well, but we won't go into those.

chris72205 7 years ago |

Similar post from Shopify about a year ago on their experience upgrading from Rails 4.2 to 5 https://shopifyengineering.myshopify.com/blogs/engineering/u...

tim333 7 years ago | |

And some similar discussion on "Shopify now on Rails 5.0. started 12 years ago on 0.5, the First version released" https://news.ycombinator.com/item?id=13448219

tjpnz 7 years ago |

It's scarily common for organizations to be running ancient versions of Rails in production. At my last gig we spent six months upgrading a Rails 2.3 application to 3.2 and before that I was working with a team that was maintaining an application written in Rails 1x. Kudos to GitHub for sharing this, I really hope they do future posts going into more detail. In my experience one of the hardest aspects to upgrading Rails is that so much of the really useful information has either fallen out of Google or succumbed to link rot.

stouset 7 years ago | |

This is true of literally everything.

People don't upgrade their dependencies across the board, and it's a massive problem for long-term security and maintainability.

jb3689 7 years ago | | |

My current company doesn't even lock most of their dependencies. At first I thought it was crazy (and it is crazy) but it does mean we address compatibility issues immediately. We are mostly running background jobs though so it's safer than anything customer facing

cptskippy 7 years ago | |

It's not just Rails, this common with many frameworks and even turnkey solutions from vendors. Nothing will ever meet your needs 100% and so you end up with some customization. It really depends on how integral those customizations were and how tightly they are coupled to the product being used.

hartator 7 years ago |

> Upgrade early and upgrade often

Not sure about the upgrade early. It’s a different kind of pain to be one of the first to use a new Rails version vs lagging a couple of months behind.

jbergstroem 7 years ago |

As a comparison, here's gitlab's journey (issue opened March, 2016): https://gitlab.com/gitlab-org/gitlab-ce/issues/14286

Looks like the first scheduled milestone was 9.5 (a year ago) and the current is set for 11.4 (next release).

Twirrim 7 years ago |

> Upgrade early and upgrade often

It seems daft to keep seeing this lesson being learned by tech companies, and keep seeing blog posts where most of the pain would have been handled easily by just making upgrading a key feature.

Instead, tech managers and engineers seem to make the same mistakes over and over again, delaying those upgrades, until suddenly they discover it's a hard task to upgrade. I get delaying to _some_ degree, it's better to let other people figure out those sharp bits on the bleeding edge for you, but you need to set an explicit target for upgrading.

At another large tech company I worked for, it took the security team swinging the sledgehammer to get teams to upgrade from known-vulnerable versions of Ruby on Rails. When they came to do it, they discovered the changes were so extreme that the effort involved in migrating was likely more than the effort involved in a complete re-write (they did at least have pretty comprehensive tests)

cortesoft 7 years ago | |

It is easy to say that in the abstract, but you always have a finite time/resource budget for doing work. Effort spent upgrading is effort not spent doing other important work. Is the other work more important in the long run? The answer is not trivial to answer.

This is why we call it 'tech debt'.. it is just like any other debt. You take it on because you don't have the current resources to avoid it, and you calculate that it is worth taking it on. But then, you are carrying the interest on it, and if you aren't careful it will grow to be unmanageable, and all your dev effort goes into just paying the interest without paying the principle.

Twirrim 7 years ago | | |

It absolutely matters. Security is a feature. If you see upgrading as merely technical debt, you're never going to give it the appropriate attention.

mikemotherwell 7 years ago | |

Someone I saw on Twitter (I forget who - apologies) suggested DepOpps - short for "Dependency Opps" - as a job role, which was defined as a person dedicated to keeping a apps dependencies up to date.

I doubt anyone would enjoy that gig, but it would be a very useful person to have in almost any multi-person team.

jimnotgym 7 years ago |

The regular thread of people piling in to criticise dynamic languages. Instead perhaps people could suggest a better language/framework that is more productive than Rails, and has had a long lifespan in a large codebase?

mooreds 7 years ago |

Super cool explanation of some of the real world difficulties of upgrading large rails applications. Really liked the transparency around process and timeline, as well as the lessons learned section.

stevebmark 7 years ago |

There are several "we upgraded Rails, it was huge, risky, and took months to years" blog posts from medium to large companies. I personally take this as a warning against using Rails. Ruby is one of the most dangerous dynamic languages to refactor, I don't see how struggling to do it for over a year is a selling point of the framework. It also feels counter to Rails's mantra of delivering value fast with little effort, until you need to upgrade, then you have months of no business value delivery and need to bring in experts to help.

sigzero 7 years ago |

I had no idea that Github even used Rails. The things I learn.

powersurge360 7 years ago | |

If you didn't know that then let me share with you this interesting story from 2012. I'm going to repeat it from memory so my details may be a little fuzzy but I'll include a link which should tell the story more faithfully.

So back in 2012 rails had a default behavior where you could mass assign values from a POST to a user and there wasn't any scrubbing of that, by default. Someone realized this was a Bad Idea and issued a pull request that would have fixed it. Instead of accepting the PR, DHH (I think it was him) said something along the lines of 'competent programmers would not leave that setting in place' and rejected the PR.

The exploit discoverer thought about this and tried it against github, which was known to run on rails and the code worked! From there he was able to manipulate the permissions on github to get access to the rails repo where he reopened and accepted his own pull request.

He was promptly banned.

https://gist.github.com/peternixey/1978249

wgerard 7 years ago | | |

Worth mentioning: Wasn't just a random person, it was Egor Homakov who has a history of finding pretty interesting exploits particularly wrt Rails and Github.

dyeje 7 years ago | | |

This was a huge deal at the time, here's one of the HN threads.

https://news.ycombinator.com/item?id=3663197

orf 7 years ago | | |

Where is the actual pull request?

dyeje 7 years ago |

Wow they've been running 3.2 for this long? That's wild considering the talk Eileen gave at RailsConf this year made it sound like alot of the Rails 6 scalability stuff was based on GitHub's existing work.

rst 7 years ago | |

They've been running 3.2 with local monkeypatches -- which is part of the reason that upgrading was problematic. (Though certainly not all; over that span, there were lots of breaking changes to supported and documented APIs.)

exabrial 7 years ago |

The advice at the end sounds exactly like something I'd say to someone going from 1.8 to 11 with Java. Great advice for any platform, very interesting to see the same conclusions from a totally different stack

aantix 7 years ago |

Using the conditional boot loading, aren’t there structural differences in ActiveRecord queries/scopes that would run under 3.2 but not 5.2?

Did GH just rewrite those scopes in their respective models and maintain a ton of if/else blocks for the different versions? And if so, didn’t they run into issues without the code not being DRY, e.g. someone fixes a 3.2 query, but not the corresponding 5.2 version?

barrkel 7 years ago | |

If you have your test code unified, and have multiple CI pipelines, it should show up immediately on your build servers.

config_yml 7 years ago |

What do they mean by off-hours? I imagine on a global site like github, there are hardly off-hours?

dyeje 7 years ago | |

Just because it's a global site doesn't mean the traffic is distributed uniformly across the day. Certain regions are going to have higher traffic during business hours. I'd guess their off hours are somewhere around 6pm PST when North / South America has stopped working, Europe / Africa is asleep, and India is just waking up.

toasterlovin 7 years ago | |

They probably mean when they weren't tied up shipping features or tracking down bugs.

starefossen 7 years ago |

As a person working for a large software consultancy in Scandinavia I hate to see so many using type safety as an excuse for not writing tests. At least a dynamic language forces you to write tests and frankly it is often easier to write tests in a dynamic language imho.

ksec 7 years ago |

I am very much looking forward to Rails 6.0 and see what Github / Shopify will upstream. Actually Instacart has lot of great gems too which I wish would have been the default solution in Rails.

throwaway427 7 years ago |

Given its maturity and settled place in the programming landscape it's always nice to see that Rails can still evoke irrational disdain in HN comments.

pwelch 7 years ago | |

100%

gameswithgo 7 years ago | |

Why is irrational to hate languages that are orders of magnitude slower than are necessary?

goatlover 7 years ago | | |

It's irrational to hate without mentioning tradeoffs. Sure, if performance is your only metric, then Ruby is a bad choice. But that's rarely the case, particularly with the web.

rpeden 7 years ago | | |

That's a reasonable question to ask.

I think in general, there are lots of reasons to like a language outside of its runtime performance.

I love working with Go and Rust due to their performance. Any I work every day in C#, which ends up nice and quick, too.

But I still love Ruby due to its expressiveness, and the way it works just seems to align with the way I think. But that's poetically because I used Smalltalk in the past and I like the bits of it that Ruby borrowed. :)

To answer to original question, though. I'd say it's irrational to hate languages that are slower than necessary because it's irrational to hate a programming language at all. No matter what language it is, it's just a bunch of words on a screen. Use the ones you like and don't waste any brain cycles thinking about the ones you don't.

Unless you're locked in a cube farm and forced to write Cobol at gunpoint all day. Hate might be rational then.

toasterlovin 7 years ago | | |

The thing you performance zealots never seem to realize is that the speed of your language basically doesn't matter for web apps, because there is usually at least 200ms just in transit time to and from the server. An extra 30-50ms spent rendering a result simply doesn't move the needle.

inerte 7 years ago | | |

Because a rational person knows that there are trade-offs in programming languages.

Is it rational to love the fastest languages? Do you rank your programming language love by an arbitrary speed index?

jb3689 7 years ago | | |

Something-something IO something-something

dwb 7 years ago |

Upgrading early(ish) and often, the very obvious preventative measure against terrible and failure-prone rewrite or upgrade projects, is one of the first things that falls by the wayside in the mostly short-termist logic that seems to dominate modern capitalism. It's absolutely infuriating.

ryenus 7 years ago |

Anyone knows which ruby runtime GitHub uses? Ruby MRI or JRuby etc.?

conroy 7 years ago |

> The upgrade started out as kind of a hobby; engineers would work on it when they had free time. There was no dedicated team.

I’m not sure why this still surprises me. For a company the size of Github, there should most certainly be a team responsible for these type of upgrades.

nautilus12 7 years ago |

Why in the world is github still on rails?

stephenhuey 7 years ago | |

Maybe because their codebase still serves their use cases very well?

And perhaps they have little to gain and possibly much to lose if they ditch it?

You didn't say much in your question, so I don't know if you feel they ought to rewrite with a popular SPA framework or use something like Elixir Phoenix, but if their Rails-based solution handily serves 30 million users, why do you feel so strongly they should move to something else?

mrdoops 7 years ago | | |

Nothing wrong with Rails, especially if the team knows it well. Time to develop is the real cost in software most of the time.

If Github wanted to integrate a lot of real-time features, then Elixir + Phoenix can't be beat. Depending on what they replace, a 10x in performance and a fraction of the servers needed is a nice win.

nautilus12 7 years ago | | |

Because Rails is too dynamic for such a mature company. Move to a statically typed language thats not bounded by a GIL. Or multiple languages that serve the right purposes for the right job.

notriddle 7 years ago | |

Why not?

k__ 7 years ago |

I had the impression GH switched Ruby for Scala years ago.

marksomnian 7 years ago | |

That was Twitter.

k__ 7 years ago | | |

Oh, lol.

Thanks :)

gameswithgo 7 years ago |

How much money in server costs and how much electricity could be saved if Github didn't use an interpreted language, but something like Go, C#, F#, Java etc?

jb3689 7 years ago | |

Github would not have been the same in any of those. They really took to some of the Rails concepts - a lot better than most Rails companies - and it shows in their product (routing, object structures, etc)

gameswithgo 7 years ago | | |

So your claim is you could not create the same user experience in any language that is jitted or compiled?

I can't really take that seriously.

erokar 7 years ago | |

If they'd used Java they would still be working on the prototype.

gameswithgo 7 years ago | | |

I've seen quite a few attempts to measure productivity differences between different languages and there is not a consistent win being shown by dynamic languages in general. Perhaps ruby on rails is especially productive for the web, and maybe especially so when github.com launched, but there are lots of options now with similar productivity and 1 or 2 orders of magnitude better performance.

mr_toad 7 years ago | |

Isn’t Ruby Jitted now?

abhorrence 7 years ago | | |

Ruby 2.6, which will be released in December (there are release candidates out now), contains method based jit infrastructure, but at least as of a few months ago the optimizations were still fairly limited and had not yet overcome the overhead of jitting.

> But after you've finished getting through the stuff > that all languages share (hardware sucks, dependencies > suck, etc), a dependently typed language will be a > matter of fixing compiler errors, not watching > percentages of 500s in prod and crossing your fingers.

> To model the constraints you create in a dependently typed > language, you have to create a set of tests and checks in > your dynamically typed language which are basically the > equivilant of a full dependent-type-system.