Surprising Scalability of Multitenancy

Surprising Scalability of Multitenancy(brooker.co.za)

75 points by federicoponzi 3 years ago | 39 comments

Animats 3 years ago |

I'd seen a more useful paper on this subject, on how to organize your game servers for a big MMO. The most economical strategy was to own your servers for the base load, and go out for AWS for peaks. Running 24/7 compute bound work on AWS is at least 2x as expensive as owning your own co-located servers.

ec109685 3 years ago | |

You can buy reserved instances that are about half the price as on-demand, so it really depends on how long the peaks are.

ddorian43 3 years ago | | |

Dedicated servers are 1/4 or less of price of on-demand (don't forget the bandwidth!!!).

andyp-kw 3 years ago | |

I guess the latency between AWS and your data centre would have a negative impact on game performance.

jitix 3 years ago | | |

I believe the idea is to spin new servers on AWS and and connect players directly to them instead of hopping via their own infra.

That’s way your profit margins on the AWS servers is lower than self hosted ones but at least you’re making money.

toast0 3 years ago | | |

Depends on the details, but pick an AWS location near your DC. And/or pick a DC location near AWS.

nvartolomei 3 years ago | |

Mind linking the said paper?

revelio 3 years ago |

The author sounds a bit scared. Maybe the recent wave of "we can save $$$ by leaving AWS" articles have them rattled?

Yes, multi-tenancy and improved hw utilization can save money ... for Amazon. That's of no use if they lack sufficient competition and just capture the savings as profits. Then you're just wasting time on debugging weird contention issues and cloud cost optimization consultants so Bezos can get richer.

The profit margins on AWS are so huge that even though you they can binpack better it often doesn't matter, you're going to still save money by going to either a cheaper cloud or using your own HW (or renting your own dedicated HW). The savings from multi-tenancy are drowned by the added costs.

One intriguing model that might be worth exploring is micro-clouds. In that model there's a kind of clearing market, and users with strong diurnal cycles and not many batch jobs can re-sell their CPU capacity at night to other users. They just implement some Lambda-ish API and configure the kernels/hypervisors to always prioritize their own jobs over guests. The guests don't care because they're getting the resources cheap, for the company the additional income offsets the cost of their own machines and the market takes a cut. The difference vs today's cloud models is it's more decentralized and the "cloud provider" is really just a match maker, so it's easy to set up competitors and margins would be low.

eecc 3 years ago | |

that'd be cool but quite improbable until exploits like RowHammer, Meltdown and Spectre can be reliably ruled out.

ElevenLathe 3 years ago | | |

Even if those were sorted, you probably want to hold out for homomorphic encryption. The threat model of Amazon having all your data is much different from the threat model of anyone willing to bid cheaply enough on a lambda execution having it. OTOH in the latter case, we can probably expect three letter agencies all over the world to be generously subsidizing our compute (for example, by reselling GovCloud at a loss).

revelio 3 years ago | | |

Those problems affect cloud providers too.

BTW modern CPUs support the creation of RAM-encrypted VMs with remote attestation, so you can lower the trust needed in the targets by a lot. That said there are lots of companies that are known quantities, have verifiable brands and may even be considered more trustworthy than the big clouds in some cases because they're local firms.

ec109685 3 years ago |

It’s ironic that AWS touts the benefit Lambda gets from overcommit, but if you build a lambda that simply turns around and makes an api call, you are paying full price for the cpu usage, even though it’s idle.

pclmulqdq 3 years ago | |

It doesn't matter if it's more efficient for Amazon (which serverless very much is) if they don't pass on the savings to you. Lambda is priced as a "value add" not as an efficiency improvement.

ec109685 3 years ago | | |

They should discount based on average cpu used.

throwawaylinux 3 years ago |

Who is this surprising to? Timesharing, timeslicing, multiprocess, multitenancy,-- whatever you call the same underlying concept -- was one of the pivotal advances in computer systems. Surely no serious person is surprised it is effective.

RcouF1uZ4gsC 3 years ago |

One thing this scalability bets on is that side channel attacks won’t get better.

Spectre and related attacks already reduced CPU performance.

Shared hardware opens up the door for side channel attacks and hardening against those attacks is going to decrease performance.

jmillikin 3 years ago | |

You'd generally use co-tenancy for workloads that are mutually trusted. Privileged services (authn/authz, machine management, deployable artifact builds) get put onto separate hardware, since their footprint is small enough that the extra 200% cost isn't material.

pclmulqdq 3 years ago | | |

This isn't how things always run in the cloud. I think the conventional wisdom is that the isolation of VMs is good enough unless you are very paranoid. Auth services are regularly run on less than full baremetal machines.

AWS serverless, by the way, uses VM isolation.