AWS announces per-second billing for EC2 instances

AWS announces per-second billing for EC2 instances(techcrunch.com)

305 points by jonny2112 8 years ago | 139 comments

zedpm 8 years ago |

That's sure nice, but I'm waiting for AWS to switch to automatic sustained use discounts [0] like GCP offers.

[0]: https://cloud.google.com/compute/docs/sustained-use-discount...

mbesto 8 years ago | |

They would legitimately lose a lot of money if they did this now (and I would argue, wouldn't make up for it in market share). Most companies I work with who switch to AWS think it's a 1:1 conversion in terms of cost from a data center and pretty much just "leave AWS on" not realizing that they are not only paying for computing cost but the hidden cost of being able to scale up more instances quickly.

user5994461 8 years ago | | |

It's really really difficult to be able to turn on and off instances as needed, like turn a test instance on in the morning when developers come in and turn it off in the evening when they leave.

jaxbot 8 years ago | |

I'm waiting for GCP to support preemptible GPU instances. Would love to be able to spend ~50% less on GPU instances when running a batch job that can be stopped and reloaded.

doh 8 years ago | | |

Not sure that will happen anytime soon. It's the same with Local SSD. The virtualization of this environment is very challenging, maybe beyond a point where it makes sense for them to do so.

_wmd 8 years ago | |

It wouldn't be a switch, it'd be a new feature. GCP also offers the equivalent of explicit instance reservations

zedpm 8 years ago | | |

Good point, poor choice of words on my part.

pmelendez 8 years ago | |

That is cool if you maintain a fixed number of instances but it doesn't seem to be very cost efficient if you are implementing autoscale

manigandham 8 years ago | | |

GCP billing is all calculated on a cpus/ram/hours basis. You're basically buying capacity units (and can even commit long-term for more discount) and can use that capacity in whatever way you want.

Running 2x 4cpu instances for 1 month is the same as 4x 2cpu instances for 1 month, and will come out the same even if you switch in the middle.

ranman 8 years ago |

Link to AWS Blog Post: https://aws.amazon.com/blogs/aws/new-per-second-billing-for-...

deafcalculus 8 years ago |

I really wish AWS would allow users to cap billing. Something that freezes all AWS services if the monthly bill exceeds X would make me a lot more comfortable when experimenting with AWS.

sarabande 8 years ago | |

Agreed. Right now you can set billing alarms, but not actually freeze billing. http://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/...

extra88 8 years ago | | |

Looking at the CloudWatch console, I see I can also add AutoScaling and EC2 actions that are triggered by an alarm. That still leaves open other risks like a bandwidth bill for S3 hosted content.

angrygoat 8 years ago | |

This is the main reason I keep a $40/month Linode running, when I'd probably save if I used AWS -- most of the time my instance isn't doing a lot. I just don't want the anxiety of a billing surprise due to a mistake, or a DDOS coming my way.

colde 8 years ago | | |

Have you looked at https://amazonlightsail.com/ as a way to make get a billing system similar to Linode?

homero 8 years ago | |

I lost $1200 to an s3 mistake when I was broke years ago, it really sucked.

cgag 8 years ago | | |

I mistakenly got a 1400 dollar bill . I just called and said it was a mistake and I don't want to pay it and they said ok. Probably too late for you but maybe helpful for someone in the future.

matteblack 8 years ago | |

Have you looked into a tool like Gorilla Stack - https://www.gorillastack.com/?

(Note: I have no investment in the company)

Ollibe 8 years ago | | |

Thanks Matteblack (whoever you may be) - my name's Oliver and I got alerted to this mention of GorillaStack.

Without wishing to be too promotional, you can set a trigger to shut off EC2 when a cost threshold is reached. You can currently automate shut off of RDS but not yet from a cost threshold (that will be available shortly).

TLDR: we can do a lot of what you ask but not all of it.

Feel free to reach out if you'd like more info.

nodesocket 8 years ago |

Per second billing is somewhat of a gimmick just so Amazon can say they are more granular than Google Compute. The difference between seconds and a minute of billing is fractions of a cent. Rounding errors.

The exception is Google Compute has a 10 minute minimum, so if you are creating machines and destroying them quickly, per second billing will be noticeable.

grzm 8 years ago | |

I think the useful comparison people are making is the difference between the previous per-hour billing and new per-second billing. Sure, if they can get some mileage comparing per-second to per-minute, great. At the end of the day isn't the increased granularity better?

mfringel 8 years ago | | |

I think "second vs. minute" might be into "distinction without a difference" territory. Sure, per-second will definitionally help at the margins, but I'm dubious about the amount of money it will actually save.

That all being said, it can enable a bunch of interesting things (e.g. more interesting stuff with AWS Lambda), and I look forward to see what per-second billing becomes an enabling technology for.

aidos 8 years ago |

This is one of the better things to happen in ec2 in years for me. We have a bunch of scripts so a spot instance can track when it came online and shut itself down effectively. It took far too much fiddling around to work around aws autoscale and get efficient billing with the per hour model. In the end we came up with a model where we protect the instances for scale in and then at the end of each hour, we have a cron that tries to shut all the worker services down, and if it can't it spins them all up again to run for another hour. If it can, then it shuts the machine down (which we have set on terminate to stop). The whole thing feels like a big kludge and for our workload we still have a load of wasted resources. We end up balancing not bringing up machines too fast during a spike against the long tail of wasted resource afterwards. This change by ec2 is going to make it all much easier.

gumby 8 years ago |

Back to the future: this was how computing worked back in the punch card days. Minicomputers and personal computers were supposed to liberate you from this tyranny: computing so cheap that you could have a whole computer to your self for a while!

mikeash 8 years ago | |

Somehow we've reached a point where a 2GHz, 2GB computer that fits in your pocket is only worth using as a terminal.

reilly3000 8 years ago | | |

Apple is moving in the opposite direction with onboard-only face ID and ARKit.

sinatra 8 years ago | |

The global scale makes even that cheap computing become fairly expensive. So, it's only natural that we'll take punch card type ideas (at local scale) from the past and apply them at our global scale.

segmondy 8 years ago | |

We still have whole computers to ourselves. Those that wish to throw their money away renting can knock themselves out.

Spooky23 8 years ago | |

We have these things. It's even cheaper in most cases. :)

JosephLark 8 years ago |

Likely due to GCP competition. I believe GCP was always per-second? [Edit: Misremember that, they were always per-minute. Lots of good information below directly form the related parties.]

Azure looks to be per-hour [Edit: Wrong again, they are per-minute as well. Oddly enough, I did check their pricing page before, but missed the per-minute paragraph and only saw the hourly pricing] but I'm seeing something about container instances possibly being per-second.

Scaevolus 8 years ago | |

GCP VMs are per-minute, with a minimum of 10 minutes (vs AWS' new minimum of 1 minute). Second resolution is nice, but I doubt it makes much difference in pricing for most workloads. https://cloud.google.com/compute/pricing#billingmodel

Azure's containers don't use a full VM-- they're more like AWS Lambda or other serverless frameworks, so they do per-second billing with no minimums.

Disclaimer: I work at Google on Container Engine.

dsp1234 8 years ago | | |

Azure's EC2 equivalent is Azure Virtual Machines[0], which bills by the minute.

[0] - https://azure.microsoft.com/en-us/services/virtual-machines/

"Keep your budget in check with low-cost, per-minute billing. You only pay for the compute time you use."

dastbe 8 years ago | | |

I would disagree on no minimums and equivalency to lambda et al, as azure container instances charge a create fee (iirc its equivalent to 100 seconds of their minimum configuration) which sits on top of the per-second billing.

doh 8 years ago | | |

We don't mind per-minute billing on GCP, but would love to get the minimum down to 1 minute or even less. We have some tasks that finish under 4 minutes where scaling horizontally instead of vertically makes much more sense to us.

andruby 8 years ago | | |

Hyper.sh runs docker containers, not VM's, and has per-second billing with a minimum of 10s.

I really want to use them to parallelise CI test runs, but haven't gotten round to setting this up yet.

jopsen 8 years ago | | |

The minimum of 1 minute makes a difference... Granted 10min is not that much of a problem :)

lostapathy 8 years ago |

This should enable some entirely new use cases, especially around CI and automation in general.

Per-second billing greatly reduces the overhead to bringing up an instance for a short task then killing it immediately - so I can do that. There's no need to build a buffer layer to add workers to a pool and leave them in the pool, so that you didn't end up paying for 30 hours of instance time to run 30, two-minute tasks within an hour.

movedx 8 years ago | |

For us it will mean we can spin down Bamboo elastic agents much quicker and save money.

YokoZar 8 years ago |

I once considered writing an EC2 autoscaler that knew the exact timestamps of the instances so that it could avoid shutting down VMs that still had 59 minutes of "free" time left because they'd been up across another hour-long threshold. That sort of nonsense logic shouldn't be useful, but Amazon was giving a huge economic incentive for it.

This is certainly a long time coming.

daigoba66 8 years ago | |

If it helps, AWS' default auto scaling algorithm specifically takes into account instances which are nearest to their next billing hour and prioritizes those for termination accordingly to, in theory, save money.

matwood 8 years ago | |

> I once considered writing an EC2 autoscaler that knew the exact timestamps of the instances so that it could avoid shutting down VMs that still had 59 minutes of "free" time left because they'd been up across another hour-long threshold.

Years ago my boss at the time did this (this was when scaling had to mostly be done in code/by hand). I just recently updated all the code as I moved it to using spots. The low price of spots made it less important to shut ones down closer to the hour mark though.

vacri 8 years ago | |

Remember that AWS has been in the game for over a decade. Per-hour billing was amazing when it came in.

Also, is the economic incentive really that huge? Or is it just a nicety?

jdc0589 8 years ago | | |

it's totally dependant on your workload. for some users there will be absolutely no difference, for others it could easily be thousands or tens of thousands of dollars of savings over a year.

macarthy12 8 years ago | |

> that it could avoid shutting down VMs that still had 59 minutes

AWS batch currently does this. I presume that will change now.

djhworld 8 years ago |

This is great news and a long time coming.

I really hope Amazon build something like Azure Container Instances [1], as per second billing would make this sort of thing feasible.

[1] https://azure.microsoft.com/en-us/services/container-instanc...

rsynnott 8 years ago |

Ah, finally. They've ruined my idea for an optimal EMR job runner. Under the old system, if you have a linearly scalable Hadoop job, it's cheaper to, say, use 60 instances to do some work in an hour vs 50 instances to do the work in 70 minutes, assuming you're getting rid of the cluster once you're done. No more!

nogox 8 years ago |

I think the per-second billing is off the point. How does it help, if the EC2 instance takes tens of seconds to launch, and tens of seconds to bootstrap?

To make the most of per-second billing, the compute unit should be deployed within seconds, e.g. immutable. prebaked container. You launch containers on demand, and pay by seconds.

bschwindHN 8 years ago | |

It has a one minute minimum anyway. And does it not help? Let's say a deployment strategy has a temporary increase in instances so it can transition to a new version of the application. If your deployment takes 5 minutes, you're only paying for 5 minutes worth of extra instances whereas the hourly billing would get you for an entire hour. Am I completely misunderstanding something?

riobard 8 years ago | |

You're describing exactly https://hyper.sh.

nogox 8 years ago | | |

Or Azure ACI.

Per-second billing doesn't add much value to EC2.

andrewstuart 8 years ago |

Really welcome, although per millisecond would be better.

It's now possible to boot operating systems in milliseconds and have them carry out a task (for example respond to a web request) and disappear again. Trouble is the clouds (AWS, Google, Azure, Digital Ocean) do not have the ability to support such fast OS boot times. Per second billing is a step in the right direction but needs to go further to millisecond billing, and clouds need to support millisecond boot times.

planteen 8 years ago | |

Just curious here, what OS can do millisecond boot times? How many milliseconds are you talking? And the constant boot time of the OS is so much less than the OS responding to the web request that this is actually worth it?

tylerhou 8 years ago | | |

https://en.wikipedia.org/wiki/Unikernel

vidarh 8 years ago | |

If you're concerned about cost, AWS is almost never the right place to host to begin with.

prohor 8 years ago | | |

Agreed - just check cloud comparison, AWS is rarely at the top: https://www.cloudorado.com/cloud_server_comparison.jsp

andrewstuart 8 years ago | | |

You miss the point.

The lifetime of a web request, for example, can be measured in milliseconds.

It is now possible, technically anyway, for operating systems to boot, service the request and disappear.

There needs to be pricing models that reflect computing models like this.

jawns 8 years ago | |

Sounds like you're describing AWS Lambda / serverless architecture. But maybe I'm not understanding your use case?

andrewstuart 8 years ago | | |

There are a wide range of tiny operating systems that can boot in a matter of milliseconds.

The applications are "whatever you want imagine" but yes one application is building FAAS Function As A Service in which the operating system carries out a single function.

Put anther way, Docker is complex, overweight, and requires re-implementation of much computing infrastructure. You can meet many of the same goals as Docker in much more simple way not by building containers but by building tiny operating systems.

ttobbaybbob 8 years ago |

Interesting that the techcrunch link has thrice as many upvotes as the amazon link

grzm 8 years ago | |

I suspect it's simply a function of which one happened to catch people's eye and where they started their discussion. Multiple submissions on the same topic (from the same or different sources) aren't that uncommon. Once one gets some momentum, it's likely to be reinforced: it'll appear higher on the front page, more people will notice it, more people will comment, more people will notice the comments on that article, which will make them notice that article, et cetera. I don't think you can read much more into it than that. I wouldn't be surprised if a mod comes along and merges the comments from one into the other, if they notice it.

What would be interesting is if they had exactly the same upvotes and comments.

ranman 8 years ago | |

Hackernews black magic :/

SadWebDeveloper 8 years ago |

Serverless advocates/engies are probably the only people celebrating this, everyone else keeps waiting for self renew instance reservation... last time i forgot about them it was too late.

scaryclam 8 years ago | |

The market for this is much broader. We do a bunch of data science so spinning up a heavy machine and only getting billed for the 5 minutes usage is a massive saving for us. I'm quite excited by this news!

SadWebDeveloper 8 years ago | | |

That is basically the serverless main "point of sell" you are just one step away from automation (if you aren't already doing it) and it will be virtually the same as serverless

nunez 8 years ago |

This is great and will save a lot of people a good amount of money.