Hetzner EX 4S Unixbench, dd and htparm results(gist.github.com) |
Hetzner EX 4S Unixbench, dd and htparm results(gist.github.com) |
An Amazon m2.4xlarge instance (High-Memory Quadruple Extra Large Reserved Instance) with Heavy Utilization on a 3-year commitment costs $9'660 for the 3y term and 3 * 365 * 24 * $0.454 = $11'931 in usage fees, which means a $21'591 total, $7'197 per year or $600 per month (EUR 458 per month at the current exchange rate).
458 EUR versus 49-60 EUR is 9x more expensive at full-time usage. (instance type and price data extracted from http://aws.amazon.com/ec2/instance-types/ and http://aws.amazon.com/ec2/pricing/ ).
If you were to consider just the Amazon Heavy Utilization term commitment for 3 years, that's still $9'660 / 3 / 12 = $268 (EUR 205) each month.
The EX 4S looks like a very compelling alternative for many use cases but saying that its 9x cheaper is comparing apples to oranges.
You can rent 3 Hetzner servers to deal with 300% load spikes or improve reliability. And you're still 3 times cheaper then EC2.
Or as others have written you can buy 3 boxes at hetzner and scale 300% ;)
Solution?
A long term contract is the perfect comparison because it's a similar service, you know you'll be using the capacity so you pay up front to have it 'reserved' and in exchange receive reduced prices. Normally with EC2 you'd buy 1 instance and then spin up 8 more on demand, with hetzner you just buy 9 machines and pay the same price you'd pay for 1 AWS machine. When your load spikes you've got 8 extra machines to handle the load. Voila, "auto scaling".
I don't know very many businesses that need to scale beyond 8X capacity for an afternoon. If you really think you need the EC2 API to add and remove machines on demand just install UEC.
It may be comparing apples to oranges but I can tell you that when I need fruit if apples are a $1/lb and oranges are $9/lb I buy apples.
Personally, I think the XS29 @ $299 per month would have been a much better comparison, with 15 drives you could push through some serious IO which would make it suitable for running a database on.
Some would say this isn't fair because you're not including elastic scaling which AWS supports, however you can't elastically scale with a 3 year commitment. Further its hard to estimate how much scaling you'll need because that's a highly variable figure.
But you can say that you will need to keep some number of servers up and running 24/7 for your service.
I think it would be a more perfect comparison to compare AWS's on-demand price to Hetzner with the built in higher performance of the 4S considered as a hot standby for scaling. (I don't know how long it takes Hetzner to provision extras, though I hear its relatively fast.) Since servers are cheap (at Hetzner, at least) it seems reasonable to have 1 or 2 extra spun up if you really have spiky traffic.
Since I'm planning to get a cluster of Hetzner boxes to support a Riak distributed cluster, and each Hetzner box is over provisioned for my needs (if I get a 4S I think they will be way over-provisioned, leaving a lot of headroom for this "elastic scaling" issue)... I think I'll compare the price of the same number of each boxes. (The AWS box has less performance, but I can elastically scale if I need to, and I'll just give the cost of those short term scale-ups to Amazon for free in this example.) The results may not be a perfect example for whatever others are considering, but it is a reasonable comparison for my purposes. (my service is a big data service, involving a lot of map reduce, requiring a cluster for operational simplicity and reliability as well as the ability to scale to millions of customers. I don't know if we'll get millions of customers, but one of our closest competitors had millions of customers within a few weeks.)
Amortizing the 4@ m2.4xlarge = 4* $600/month = $2,400 (using the $600 for the reserved instance costs from vladd's example.)
4@ Hetzner 4S = $149 Euro setup fee, amortized over 3 years = 4.14 Euro/month Add the 49.58 Euros in hosting = 53.71 euros x 4 servers = 214.87 euros or $282.50 a month.
So, for my cluster, AWS is 8.5 times as expensive.
I think that $282 per month is a lot easier for a scrappy startup to handle than $2,400 as well. This brings "big data" within the realm of undefended mortals.
Many people use AWS for all the extra services that AWS offers. The value of these services is, of course, subjective, as some startups will need them and others won't. But they all are based on a custom Amazon API which means if you build your architecture across many servers at Amazon you've got a fair amount of lock-in.
For me, I'll take the 8.5X reduction in costs, over lock in anyway. If Hetzner starts performing poorly, I can migrate without too much difficulty, and without having to re-architect my service.
Quite apart from the ability to buy by the hour, the Hetzner offering really isn't suitable for processing any data you care about - it's not using ECC RAM, and the processor used doesn't support it.
In one study (http://www.cs.toronto.edu/%7Ebianca/papers/sigmetrics09.pdf), strong correlations were found between age, usage and error rate. Before using a server without ECC RAM, it's worth doing a quick estimate of the amount of data you intend to move around, and whether it's worth it to you (and, more importantly, your customers) to save a little money going with a non-ECC platform.
http://www.hetzner.de/en/hosting/produkte_rootserver/ex6
a little more expensive (69 EUR) and less RAM (16 GB) but using ECC.
[1] http://blog.cloudharmony.com/2010/05/what-is-ecu-cpu-benchma...
The recent buzz is the first time I see them mentioned on US-centric sites though, if memory serves.
As for your question, it appears that many people find the price/performance ratio of the EX 4S compelling.
That still doesn't mean that it's a fair comparison. EC2 has plenty of other features out-of-the-box like the ability to manage security groups, VPNs, elastic IPs, easy access to other AWS offerings like SQS and S3, the ability to treat images and the machines they're running on as independent, multiple locations, etc.etc.
Yes you can have all these things outside of the cloud, and yes you might not need any or all of them. (EDIT: and yes using their services means you're locked-in to some degree.) Still, I maintain my original point which is that you can't really say that Hetzner is X times cheaper based purely on the hardware specs, disregarding the fact that EC2 is much more than just a bunch of virtual machines.
I have found that OVH has a virtual rack that can accomplish this. After years with Hetzner I think it is time to move on
I was assuming that having the servers in different racks would be an advantage, mainly because the reason I have multiple servers is for data redundancy (I'm using Riak and data is replicated across the cluster.) However, in Riak the servers do talk to each other a fair bit.
The latency and bandwidth between a given server and a web browser out in the world shouldn't vary much from rack to rack, I'm assuming.
I am curious about a couple of specific use cases that I can think of where this might affect me (and, most likely, would be common points of failure for others):
1. data in the MySQL tables (stored in memory) is corrupted. Would mysql crash? Indicate table corruption, and I could just reload it from disk? Write corrupted data to disk, and permanently trash my data collection?
2. large process that's running data analysis (say a big python process with tons of data in RAM). Would one of my variables (say an int with value 4) turn into another number? Would it become unreadable?
I appreciate the effort to explain this. I know, in theory, why ECC RAM is useful, but I have difficulty visualizing real world scenarios.
With MySQL any of those things you can happen. If you're lucky then only the cache is corrupted and then you can just reload from disk. If you're unlucky then the data got corrupted on its way to disk and the wrong data will be written to disk. If you are astronomically unlucky then the in memory machine code of MySQL got changed in such a way that it starts overwriting your entire disk with garbage. You should probably be more afraid of meteorites though. And of bugs in either your own or others' code.
ECC RAM reduces the probability of such a bit flip happening. That doesn't mean that they are eliminated entirely. So you have to do these two things in any case:
1. Bit flips can cause processes to misbehave/crash. So you want to have a way to detect and restart misbehaving/crashed processes.
2. Even with ECC RAM you want to do your own error correction for critical data (say a bank transaction log).
Here is an interesting paper that discusses the prevalence of DRAM errors and the effectiveness of ECC RAM:
DRAM Errors in the Wild: A Large-Scale Field Study -- http://www.cs.toronto.edu/~bianca/papers/sigmetrics09.pdf
It would be interesting if somebody did an experiment where they artificially flipped bits of various software's memory to see what happens. I'd expect that in many cases it doesn't do any harm at all.
Let's use MySQL as an example. A bit flip in the memory which holds the code may cause it to crash. A bit flip in the 'metadata' could cause the table to become corrupted, potentially recoverably. A bit flip in the data itself could turn 'Travis' into 'Trbvis', which might go undetected depending on where it happened and which storage engine you are using.
The use of memory for OS page caching (less so in databases, which often use O_DIRECT and more so in other programs) means that arbitrary corruption could happen to pieces of disk data your program didn't even touch, if you touch data near them.
I've had non-ECC RAM systems destroy database tables leading to data loss.
Systems with ECC are either able to correct the error (and log it), or throw the alarm bells. Even discounting RAM bit flips, simple bad RAM can destroy your data. Having an ECC aware system (including getting Linux to check the EDAC or have the baseband monitor do so) has saved me many times from failing hardware.