I got pwned by my cloud costs

I got pwned by my cloud costs(troyhunt.com)

1398 points by andimm 4 years ago | 641 comments

buro9 4 years ago |

Don't put Cloudflare in front of a Cloud egress bill. i.e. don't do this: Azure|Amazon > Cloudflare

Always use your own proxy where the egress is well within your free tier, i.e. do this: Azure|Amazon > Hetzner|Linode > Cloudflare

Why?

Because Cloudflare cache is a massively multi-tenant LRU cache and whilst hot files will be cached well (and with Cloudflare Tiered Cache even better - but this itself is a cost) anything else is still going to expose you to some degree of egress cost.

When I exposed AWS to the web I paid $3k per month to AWS. With Cloudflare in front of AWS I paid $300 per month to AWS. With Linode in front of AWS and behind Cloudflare I paid $20 per month to Linode and about $12 per month to AWS.

A Linode, Hetzner instance... or any other dumb cheap web server that comes with a healthy free tier of bandwidth is all you need to set up a simple nginx reverse proxy and have it cache things to disk https://docs.nginx.com/nginx/admin-guide/content-cache/conte...

sascha_sl 4 years ago | |

Or simply use a proper CDN that doesn't pretend to eat all the cost for a flat fee but then sometimes does not. BunnyCDN has an amazing volume tier at half a cent per GB.

buro9 4 years ago | | |

Oh exactly that.

Or if caching is your biggest priority then Fastly or Akamai will shine too.

But if you're balancing all considerations and want the cheap "good enough" caching with the DDoS protection, free TLS certs, and unmetered (assuming you aren't imgur or something)... then Cloudflare does a great job at being good enough. And for those sharp edges... drop in a proxy of your own, or layer your CDNs.

reitzensteinm 4 years ago | | |

Will BunnyCDN reliably keep an 18gb file in cache without hitting origin? I use and like Bunny, but relying on that to not get a massive bill in the mail scares the shit out of me.

jimbobimbo 4 years ago | | |

Azure has its own CDN. If one wants to do Cloudfare -> CDN -> Azure Storage, then at least let it be Azure CDN in the middle, not another cloud provider in the mix. ¯\_(ツ)_/¯

z3t4 4 years ago | | |

Or simply run everything on your own server. All those middlemen are going to kill any latency improvements you get from anycast edge servers.

martindbp 4 years ago | |

I've switched to Backlaze B2, which has a bandwidth alliance with Cloudflare. Even without it, B2 egress is something like 1/5th of S3, so may be worth thinking about.

rawtxapp 4 years ago | |

If you use argo caching on Cloudflare, it should reduce origin server load even more. Essentially, instead of going directly to your origin, cloudflare endpoint will first reach to it's root node to see if it's cached there and only that node is allowed to communicate with your origin. I see like ~95% cache hits with that turned on.

bastawhiz 4 years ago | | |

Argo does not affect caching, only performance. You're maybe mistaking it for tiered caching or a custom caching topology.

XCSme 4 years ago | |

> Azure|Amazon > Hetzner|Linode > Cloudflare

Why not directly Hetzner|Linode > Cloudflare?

nightpool 4 years ago | | |

Because Hetzner and Linode VPSs have fixed disk sizes, while Azure and AWS have basically infinite storage. You use your cheap commodity VPS as a cache, not a source-of-truth.

nostrebored 4 years ago | | |

So that you incur as much downtime risk as possible, obviously.

I hate these 'cloud economics' optimizations that people tend to try.

zrail 4 years ago | |

Another option if Linode's included bandwidth + overages is too much is a dedicated box from Reliable Site. I'm not a customer nor am I affiliated with them at all, I just occasionally check in on their low end prices and noticed that they've started included an unmetered 1Gbps port with every host.

https://www.reliablesite.net

(search HN and reddit for that URL, you'll see they've been around and recommended for a really long time).

edub 4 years ago | |

If you're going to have an intermediary proxy that you run, for AWS perhaps use Lightsail. It is price competitive, and includes more bandwidth than Linode/DigitalOcean/Vultr for the price.

klohto 4 years ago | | |

You are not allowed to use Lightsail once you use more professional services on AWS atleast per ToS

ddlutz 4 years ago | |

Why not use the CDN of the cloud provider you are on? Azure Storage > Azure CDN

tills13 4 years ago | | |

Reducing CloudFlare to a CDN is a disservice. They have some amazing services like Bot Management and Workers that make them very appealing. The CDN is just a nice bonus.

pojzon 4 years ago | | |

Because its order pf magnitude more expensive like anything on the cloud really..

bastawhiz 4 years ago | | |

Azure CDN offers almost no discount on egress over Azure storage directly. The same is the case with Amazon's equivalent services.

canucker2016 4 years ago | |

Or Troy Hunt can ping his Cloudflare contacts and see if he can get access to Cloudflare R2 Storage.

see https://blog.cloudflare.com/introducing-r2-object-storage/

From the Cloudflare blog, it seems R2 would've handled this exact situation - auto-migration of cloud S3-like-storage objects - download from cloud-storage just once and cache in R2 for Cloudflare to serve.

lewisl9029 4 years ago | | |

Has anyone gotten access to R2 yet? I signed up but haven't heard back myself.

Would love to find out if you can write to any/every region and have things replicate, or if you have to write to a single region. BunnyCDN's edge storage solution looked interesting until I found out it only supported writes to a single region.

Hoping R2 might be my savior here, otherwise will probably have to roll my own active-active minio cluster, which I'm not looking forward to maintaining. Other suggestions welcome!

cuham_1754 4 years ago | |

How about Amazon Lightsail? It price structure is basically the same with Hetzner or Linode, and you get it in-house if you use AWS.

manquer 4 years ago | | |

It is not compute cost it is b/w costs. That is pretty much same beyond free tier within AWS .

jitbit 4 years ago | |

CloudFlare tiered cache is now free BTW

Dave3of5 4 years ago |

Ah the old cloud provider switcheroo. Yip this is the way they make money. They make it easy to setup some gigantic hugely scalable website then hit you with a gigantic scaled up bill. AWS would do this as well.

Team I'm in at the moment is in the early stages of cloud adoption but the company in total has fell hook line and sinker for AWS. When I mentioned the cost there is always an excuse.

The main one being that you don't have to hire sysadmins anymore as that's taken care now by AWS. Ah yes but they have actually been replaced with a "DevOps" team plus just our department now spend > £1 million per year to AWS in hosting costs. A 20% reduction in those fees could pay for a few sysadmin(s).

The next one is that no other vendor would be able to supply the kit. You know StackOverflow is able to run on a single webserver (https://nickcraver.com/blog/2016/02/17/stack-overflow-the-ar...). Plus many of the other providers have loads of instances available.

I mean I'm not against cloud it's just not the cheapest option if you choose one of the big 3 providers. I use a company called scaleway (https://www.scaleway.com/en/) they have all the essential cloud services you need and everything else you can run yourself in docker or k8s.

cyberCleve 4 years ago |

Ouch. If Troy Hunt of all people can make this mistake, it can happen to anybody. HIBP is an awesome service funded totally by donations, so it's too bad this happened. Of course Microsoft is happy to hide behind their confusing pricing model and let customers overpay for Azure without alerting them.

oneepic 4 years ago |

It is worth mentioning that the alert itself costs money. So if you're evaluating the alert every 5 minutes on the past 24h of data it can burn a small but surprising amount of money.

From TFA it looks like that would be 10 cents per "time series". Or what I translate it to, is 10 cents every 5 minutes (*I think, but I havent used Azure in some time*). $1.20/hour, $28.80/day, almost $900/month. Not too hard to drop that by making the alert less frequent. (edit: I think I saw AU$ there, so maybe it is AU$900.)

manarth 4 years ago | |

A time-series represents a "thing you're monitoring" – in this instance, it's aggregate egress, so $0.10 per month, regardless of the evaluation period.

Monitoring CPU? Another $0.10 per month. Memory? Another $0.10.

Thankfully, not $900.

oneepic 4 years ago | | |

I meant to emphasize frequency, not eval period. Apologies. That said I took a look at the pricing docs and didnt see frequency mentioned, so hopefully I am in the wrong about the price.

As an aside, their (Azure's) pricing docs are written in the same fishy way their technical docs are written (my opinion only)...

TriNetra 4 years ago | |

Shameless plug: https://CloudAlarm.in (in beta), sends you real alerts usually faster than azure with multiple reminders. It does this daily unless you tell it to shut up for the month for the given exceed. I call it real alerts because it doesn't wait for consumption threshold to reach the way Azure cost alerts do; as soon as it detects that your current cost * remaining days > the budget amount, it'll send you an alert [1].

The alert emails are way more meaningful (with projected amount in subject for example) unlike generic ones from Azure Alerts, so you see a real alert and prompted to take immediate action.

1: https://cloudalarm.in/Home/Docs/#how-is-budget-alarm-differe...

GordonS 4 years ago | | |

But surely CloudAlarm relies on the same data as Azure's alerts do? Azure support told me that data is only updated daily.

Also, Azure has an option to alert you beforehand if it looks like you'll go over; struggling to see how your service is any better.

mnahkies 4 years ago | |

This is something to be mindful of when using datadog synthetics monitors as well - if you have a short interval, or many locations being tested from they can become expensive quickly

godot 4 years ago |

These stories almost always boil down to this fundamental conflict of what you want for a personal project vs a business. (though in this case yes, Troy Hunt's HIBP is larger than a lot of startup businesses)

In a business setting, you want your service to stay up, at the cost of spike in costs if accidents or mistakes happen.

In a personal project, you want there to be hard limit on cost, and your service to go down if spikes call for it. (I'm relatively sure that no one wants their personal projects to incur a bill of thousands of dollars by accident.)

stevehind 4 years ago |

Have you contacted Azure? On one hand you owe the money “fair and square”, but on the other if I were them I’d waive an unexpected $10k bill to a good faith actor that was incurred without any proactive notification by Azure.

suction 4 years ago |

I wonder if before cloud computing, has there ever been a successful product / service where it was accepted with just a shrug that the volatility of monthly costs means it could bankrupt you with next month's bill, because of complexities and opaqueness of the cost structure make it virtually impossible to predict and protect against extreme peaks in all parts of the setup.

Even if you run a relatively opaque cost structure business like a restaurant, you can still calculate the maximum cost of ingredients for one month, the salaries, energy, etc. if you simply use the "best case scenario" of having every seat at every table booked for all opening hours, with people ordering your most sold dishes. Cloud computing is still leagues above that in terms of cost predictability.

I once worked for small, non-startup software company who pondered moving servers to Azure. The Azure partner shop analysed the needs and came up with a monthly cost "between 30k and 120k per month". They were really surprised the company stuck with their non-cloud setup because "everybody is moving into the cloud!!"

usr1106 4 years ago |

That's the typical story. Something goes wrong and it costs you (typically a small company) a lot of money. At that time just nobody is looking at metrics. Even alarms don't help absolutely because they can also be missed.

The only thing that would really help were a hard spending limit that stops all services except storage. If your site is important there will be such an amount of user feedback that it is impossible to miss it for a long time.

dspillett 4 years ago | |

Alerts can also fail to be timely due to mail/SMS/other delivery issues, or the right people being in the middle of something else. This delay means it is still possible to rack up and unexpected cost.

Or they can fail completely.

And the alerts themselves cost if you want something reliable so you have to weight that against the danger. Pay as you go cloud can be a maze of costing concerns..

> The only thing that would really help were a hard spending limit that stops all services except storage.

Yep. Though that is small comfort if you need to guarantee more than a couple if 9s of uptime, hopefully those with that requirement can soak up the unexpected billing blips.

alfiedotwtf 4 years ago | |

> The only thing that would really help were a hard spending limit that stops all services except storage.

Sadly, I haven't found a way to do that with AWS

dx034 4 years ago | | |

It's funny that even Hetzner can do that and AWS can't. Shows that there's no interest from AWS to prevent these things from happening.

UnFleshedOne 4 years ago | | |

I just looked at my AWS account and there seems to be a way to set budget, attach alerts to it and attach actions to alerts. For example there is an action to stop EC2 instances. Not sure if other AWS services have something similar, but at least you can kill your instances if something weird happens.

Actions weren't there last time I checked (few years ago).

Monotoko 4 years ago | | |

Kill switches in lambda I believe is possible, running when the alert is triggered

mrb 4 years ago |

Most worrying is that even an expert like Troy Hunt was UNABLE to figure out the cause of the issue by himself. He "reached out to a friend at Cloudflare" who investigated and found the cause.

alkonaut 4 years ago |

Cloud providers should always have a max spend and it should be a standard feature. The cap shouldn't even be some optional feature or notification service. It should be a hard cap that you can move - at your own risk.

manquer 4 years ago | |

SMB or indie developers are not the first/primary customers for Azure/AWS that they design their application for.

Any enterprise will not want any limits because of spends, they would be lot more pissed if service was pulled because spending cap set by someone sometime in the past is now exceeded. Likely is why such feature is optional not mandatory.

Excess/unexpected billing would be negotiated in typical sales cycle discussions. Making a default hard cap however would result in a lot of senior people are going midnight calls for emergency budget approvals, management would get annoyed by that.

foobiekr 4 years ago | | |

I 100% work for a large enterprise and we would absolutely like spending policies in place. After all, we have fixed OPEX budgets planned in advance of the quarter.

kuu 4 years ago |

One thing I hate about the cloud providers is that there isn't an option to set a maximum cost. I would prefer to plug the cable of my side project than just receive an email saying me that next bill is going to be over my cost. I understand not everyone would like to do that, but I would like to have that option.

defaultname 4 years ago | |

Oracle has fantastic budget tools. Not just "you've passed your budget", but "you're forecast to pass your budget in 22 days before the month is up". And you can couple it with quotas to create hard budgets.

AWS has decent tools in this regard, but it pales compared to Oracle. Azure is a product I've never used with any scale (just small projects), but the fact that it actually costs money to setup alerts is gross (and morally reprehensible). Even if it's a trivial amount, that alone just sours the product in my eyes. I mean, already Azure is pretty uncompetitive unless you're running on free credits, as Troy apparently is (purportedly some $13K per year, so unsure what the pitch for donations to cover a bill is about).

schemescape 4 years ago | | |

This piqued my interest, but a few quick searches (using a search engine--the Oracle Cloud site search only turned up press releases...), indicate that quotas just prevent you from spinning up new instances. That's helpful, but I was hoping for some sort of way to cap my bill (for hobby projects), even if that requries deleting resources.

Oracle Cloud has an enticing free tier, but I'm too afraid to use it because it requires a credit card and I don't see any way to put a monthly cap on my budget. (I'm sure hobby projects with ~$5 - 10/month budgets isn't their target market, but I can dream :)

Edit to add the page I was reading: https://docs.oracle.com/en/cloud/get-started/subscriptions-c...

cma 4 years ago | |

They'd rather refund small guys for mistakes than give big guys an easy limit to set.

cdmckay 4 years ago |

It would be really classy if MS forgave that debt, especially considering the service is a public benefit.

kelsolaar 4 years ago | |

I would go as far as saying that the hosting for such a service should be entirely sponsored by Microsoft.

lodovic 4 years ago | | |

He's a "Microsoft Regional Director and MVP" so Microsoft pays the bill one way or another. I expect that he has reduced Azure rates as well.

anothernewdude 4 years ago | |

Would be even classier if the major cloud providers responded to customers calling out for budget limits for the past decade. Not many people want to risk potentially infinite costs.

scanr 4 years ago |

I wonder how much of the cloud provider revenue comes from situations like this. I suspect quite a lot.

I think that the cloud provider business model that allows for uncapped maximum costs is a bit of a commercial dark pattern. What makes it somewhat more nefarious is that it is relatively easy to blame the customer.

I’m not surprised that the cloud providers are quick to refund users as it’s likely that they only do it in a fraction of cases and it buys a lot of goodwill.

It would be interesting to try and design a cloud that supports OutOfMoneyException’s with gradual degradation and capped liability for costs built in.

Nextgrid 4 years ago | |

> I suspect quite a lot.

I don't actually believe so. Cloud providers are known to refund bills incurred by mistake. They make so much margins on legitimate usage by big companies & startups that it's just not worth burning developer goodwill & potentially waste efforts trying to collect a bill the customer legitimately can't pay (and will guarantee he will never use nor advocate for your service again).

throwawayffffas 4 years ago |

Question, is the 0.014AUD per GB quoted here correct? Looking at the linked page[1] I would think the cost would be 0.1102AUD per GB as is quoted in the Internet egress section.

https://azure.microsoft.com/en-au/pricing/details/bandwidth/

throwawayffffas 4 years ago | |

Also (3200 GB per day * 30 days) * 0.014 AUD per GB is 1344 AUD. While (3200 GB per day * 30 days * 0.1102 AUD per GB) is 10579.2 AUD much closer to the final bill.

My conclusion Troy still doesn't know how much he is paying.

CodesInChaos 4 years ago | |

It clearly isn't. It looks like he confusing transfers between availability zones in one region with egress to the internet. A factor 10 mistake like that should be obvious, but he didn't fix it, even after I pointed it out in the comments on his blog (he responded that the price for me might be different due to region/currency settings).

zzt123 4 years ago |

Interestingly, Troy says that egress is expensive on Azure at $0.014 AUD/gB (~$0.010 USD/gB), but that is the same price as additional egress for Linode and DO, and Linode egress has never struck me as expensive. In fact, I’m kind of shocked (as an AWS user) that Azure egress is the same price as Linode.

Actually, wow it seems AWS is also the same price as Linode and DO for egress. While Linodes and DO do come with decent free bandwidth, this is a surprise to me.

coder543 4 years ago | |

You’ve interpreted the numbers wrong. Yes, Linode, DigitalOcean, and most of this class of providers charge $0.01/GB. Almost literally an order of magnitude less than Azure or AWS. The megaclouds massively overcharge for bandwidth. It’s not even close.

AWS charges $0.09/GB, and Azure charges $0.0875/GB.

Maybe Troy Hunt gets a discount for being a Microsoft Regional Director and MVP. (Neither of which make him an employee of Microsoft, confusingly enough.)

https://docs.digitalocean.com/products/billing/bandwidth/

https://www.linode.com/docs/guides/network-transfer/

https://aws.amazon.com/ec2/pricing/on-demand/

https://azure.microsoft.com/en-us/pricing/details/bandwidth/

bluedino 4 years ago |

Reminds me of a time, we had a new site that was going to run on GCP, we had been using a couple co-located servers for years.

When everything was moved to production, URL went live, nobody ever did any kind of bandwidth checking, caching, no CDN, no cost tracking. $10,000 in our first week. That's about 1/4 what our total spend on the co-located servers was for the whole year. Boss flipped his lid and wanted to kill the new guy who was on the project.

After about 2 years we got rid of all the co-located stuff and were spending about 1.5x, but we had more apps, they served heavier pages, etc.

dijit 4 years ago | |

1.5x is pretty good.

We overspent quite heavily on our on-prem stuff for a game I helped launch, for political reasons the next game ended up running on the cloud.

The price was roughly 10x before discounts. With our heavy discounts and a wide amount of slimming down/cost optimisation (easily 3 months of work) we got it to 2.3x

There will always be a need for sysadmins/cloudops/devops for that environment, so we didn't save any headcount either.

I can't imagine getting anywhere close to parity in costs, Functions-as-a-service ended up costing more than compute instances too so we went back to compute instances in places where we thought we'd get away from it.

That said, it was a lot nicer to use!

hogrider 4 years ago | |

Awful toxic boss.

hdjjhhvvhga 4 years ago |

It is very good these things are getting publicized. More and more people realize these payment schemes for what they are: a scam. Every cloud provider that refuse to put a hard spending limit participates in this.

It is important to remember that not all cloud providers participate in it. For example, in Hetzner Cloud, they explicitly provide the maximum amount you are going to pay for a given instance or service in a given month. You are guaranteed not to pay more. Everybody knows why Amazon etc. refuses to do it this way.

zekica 4 years ago | |

On Hetzner and with their €1.00 per TB after 20TB included, you can pay up to €324 per vps as you are limited to 1Gbps if you fully saturate the link all month.

dx034 4 years ago | | |

I doubt you'll manage to get the exact 1Gbps per VPS out all month. On dedicated that's more likely. But luckily they have a very easy setting for billing alerts and maximum in the settings page.

mawalu 4 years ago | |

Hetzner Cloud(!) only has 20TB/Month included in the monthly costs and states that you have to pay for any additional traffic. I never reached that on one of their cloud boxes so I don't know how it looks like but it definitely isn't all up front. But yes the dedicated machines come with no additional traffic charges whatsoever

CodesInChaos 4 years ago | | |

Additional traffic costs 1 EUR/TB (plus VAT, depending on where you live). So it's about 50 times cheaper than the big clouds.

Seattle3503 4 years ago |

My (naive) solution. Every new account by default has an SMS alert that trips at $100. It says

"Your account has exceed $100 spend. Reply 'SHUTDOWN' to shutdown all services, 'STOP ALERTS' to never see this alert again, or 'DOUBLE TRIGGER' to double the alert trigger value to $200."

$100 is arbitrary, it could be any nominal sum. The idea being that the user can double the alert each time they get it just from SMS. I bet 95% of users would double their alert limit to a comfortable point. The other ~5% will be power users who customize their alerts.

The idea that these companies couldn't know what limits customers want is kinda silly. We can use the same techniques for alerts that we use in algorithms for expanding vector storage, for example. We can "amortize" alerts, so to speak.

Nextgrid 4 years ago | |

The problem is that metering these services at such granularity is difficult: https://news.ycombinator.com/item?id=30066538

Seattle3503 4 years ago | | |

It doesn't need to be very accurate. As long as the values are the same order of magnitude it is probably okay.

llampx 4 years ago |

Very nice writeup, thanks to the author for writing it so clearly for someone who is not familiar with the nitty-gritty to be able to follow it.

kidsil 4 years ago |

Shameless plug - the core of my work is about ensuring these unexpected costs never happen.

We have some recent case studies where we've successfully reduced cloud costs by 95%

https://www.cloudexpat.com/case-studies/

hi(at)cloudexpat.com - happy to help!

Nextgrid 4 years ago | |

Out of curiosity, do you merely optimize existing cloud usage or do you help your clients move to hybrid/bare-metal?

knorker 4 years ago |

As soon as I saw "17GB file" i thought "that's what torrents are for". Otherwise one mistake and... Well this happens.

Or someone maliciously bypasses CF cache e.g. by parameters.

Cloud just is not suitable for any kind of volume egress. It's a death trap. Like going on vacation with data roaming enabled.

Aissen 4 years ago | |

Yeah, HIBP is using torrents:

> I removed the direct download links from the HIBP website and just left the torrents which had plenty of seeds so it was still easy to get the data. Since then, Cloudflare upped that 15GB limit and I've restored the links for folks that aren't in a position to pull down a torrent. Crisis over.

knorker 4 years ago | | |

I know, I read the article.

But I feel like Dr Strangelove here. Of course, the whole point of a torrent on a cloud service is lost if you also provide a raw download link.

Also providing a download link is tempting, but can easily cost (for a 17GB file and growing) up to US $3 per click.

Even off of their premium global network it's over $2 per click. The cheapest in Microsofts entire egress table would be $0.68 per click. (but that only kicks in after you've spent way more than $9400 in cheaper tiers in a given month)

Egress kills you, in cloud. "Oh, cloudflare probably caches most of this" is not something I'd recommend.

dx034 4 years ago | | |

And then Cloudflare will not cache it at some locations for random reasons and the cloud bill is back. Anyone with technical knowledge should have no problem routing static files via machines at OVH/Hetzner and the like, no reason to enter such risks for maybe an hour of setup time saved.

dx034 4 years ago | |

Or Hetzner server auction to get a cheap 20/30€ machine with unlimited traffic at 1Gbps. Setup time is max 1h even if you do it manually, with cloudflare Tunnel it's also really easy to lock down everything with a firewall and have minimal exposure to threats.

faebi 4 years ago |

I have 10gbits internet at home. Sometimes I wonder how many services/people I could bankrupt by using it harder. Not that I want this, but more like, why is it even possible?

ccbccccbbcccbb 4 years ago |

> I have been, and still remain, a massive proponent of "the cloud".

Mice cried and stung themselves, but kept eating the cactus.

sudhirj 4 years ago |

This particular problem basically boils down to "CDN providers don't like caching large files", which is a very common problem. Everything else was configured and setup exactly right to not have a large bill.

Most CDN providers have a lot of machines out on the edges of their networks, and it's understandable that they don't stuff these machines with large disks, likely preferring smaller faster SSDs. But this is a very common pitfall of CDNs that needs more attention, along with messaging on the dashboards and settings pages.

I've had problems with no warning on Cloudfront, Cloudflare, Bunny.net all from not realising that my files were beyond the CDN's cache size limit, but none of them seem to do a good job at surfacing this other than "talk to customer support".

Cloudfront does list the max size clearly in the limits and quotas page, though, and if you front your S3 bucket with Cloudfront, you could turn caching off and still get the discounted bandwidth out rates (S3 -> Cloudfront is always free, even if the file is fetched every time).

jrochkind1 4 years ago | |

Cloudfront isn't much discounted bandwidth out compared to S3 though, is it?

I see S3 is initial $0.09/GB, going down to $0.07 after 50TB or $0.05 after 150TB.

Cloudfront North America is $0.085 for first 10TB; but $0.110 and up for other regions. going down to $0.060 north america after 100TB, and okay $0.025 after 1PB. (but $0.050 and up in other regions even after 1PB).

So okay, Cloudfront gets cheaper egress at large scale, I guess. By about 50% though, not an order of magnitude, and could be much less depending on region.

sudhirj 4 years ago | | |

The reserved capacity pricing is lower, in a business setting your account manager will usually suggest this pretty quickly if you have a steady and/or increasing Cloudfront bill.

2ion 4 years ago |

This is why I use fixed price offerings for personal projects.

A large bill is probably chump change for someone like Troy, for others it's a year or two of savings. The risk is not worth it.

schemescape 4 years ago | |

Would you mind sharing the services you’ve found that have fixed prices? I haven’t had much luck finding services like that (although I’m looking in the < $20/month range).

manquer 4 years ago | | |

For fixed price and fixed performance you can use bare metal providers with unmetered bandwidth generally tier 2 vendors offer that.

At $20 bare metal is not easily possible, the lowest prices I have seen are usually 40-50 and above. Howveve you can get a VPS with unmetered bandwidth and no other costs at your price range [1]. The price is still fixed some performance variances may be there, at $20 minor variances are unavoidable.

[1] https://us.ovhcloud.com/vps/compare/

ksec 4 years ago |

>What we're talking about here is egress bandwidth for data being sent out of Microsoft's Azure infrastructure (priced at AU$0.014 per GB).

AUD $0.014 is roughly USD $0.01. Which I thought was reasonable. But on [1] only "Data transfer between Availability Zones(Egress and Ingress)" cost $0.01. Do transferring from Azure to CF count as that? Other Internet egress (routed via Routing preference transit ISP network) starts at $0.08

I hope someone from Azure CS could give him a custom discount.

It is also worth thinking, the cost HIBP saved on Cloud / Serverless over the years could have wiped out ( if not more ) by this single incident.

[1] https://azure.microsoft.com/en-au/pricing/details/bandwidth/...

nbevans 4 years ago | |

Cloudflare and Azure have a "Bandwidth Alliance" peering which - if you correctly set up your Azure resources to use "Internet Routing" - will result in a modest discount. It is a bit of a scam though as it is marketed as though you'll get 100% discount but in reality it is more like 15% off. I think GCP is 100% though.

gcbirzan 4 years ago | | |

Definitely not 100%, more like 66% off: https://cloud.google.com/network-connectivity/docs/cdn-inter...

hkh 4 years ago |

We've been thinking about this for a while, and if there is any way we can catch these types of cost spikes before they happen. We've managed to do it for Terraform resources using an estimation approach, and using a usage file, you can model expected usage-based resources (https://github.com/infracost/infracost/blob/master/infracost...), but this one has got us thinking more about policies.

To be clear - we would not have been able to catch this one right now :'(

Would love to hear thoughts / brainstorm ideas - is there any way we can proactively catch these types of cost spikes?

Olreich 4 years ago | |

I think this is fundamental to on-demand services. Anything outside terraform or another configuration file system is hard to reason about. If cloudflare is in your config system, then you could put up a warning that files bigger than whatever won’t get cached, but that still assumes a level of knowledge about the system that you don’t generally have.

Setting up limits and alerts as part of the system creation is usually the best strategy.

hkh 4 years ago | | |

I like that, maybe we have to build up a knowledge base of wisdom (probably learnt through the hard way), and warn if the conditions are met or at least a list of the things to note. Then the cloud cost alert being a fallback safety net.

nbevans 4 years ago |

One wonders how Cloudflare can essentially absorb all bandwidth costs. But AWS and Azure are using them as a profit center.

uncertainrhymes 4 years ago | |

On the cloud providers, you are paying for your usage (yes, marked up, but they have costs too).

Cloudflare has the same model, but they distribute the costs. The vast majority of people never use anywhere close to their share, so they subsidize the outliers and the free tier.

tyingq 4 years ago | |

Lots of peering. They pay $0 for roughly half of their egress.

https://blog.cloudflare.com/the-relative-cost-of-bandwidth-a...

OtomotO 4 years ago |

Well, the cloud is just a convenient way of accessing someone else's server.

Convenience always costs money, there is no (big) cloud provider doing it out of their own pocket or rather not optimizing for huge profits.

It's the same as with any other service, really. So I don't understand, why some people assume it would be different here.

(Note: I am not saying that Troy Hunt assumed this, but I know people who go to the cloud because "It's cheaper". It was never cheaper, on no project I worked on. It was more convenient, but in the end it was more expensive mostly)

DigitalSea 4 years ago |

I would be surprised if Azure doesn't waive or reduce this bill dramatically. Something similar happened to me with AWS. I had a simple file upload service where files would expire if they hadn't been accessed in 24 hours. Someone started using it to upload music and videos. I ended up with a high bandwidth bill on Amazon S3. I reached out and explained what happened, they waived the costs entirely (to the tune of $5000).

Abishek_Muthian 4 years ago |

Valuable investigation steps to find the erring cloud resource, But as Troy concludes 'Budget Alerts' would have saved him from this issue.

No matter what the traffic is, The first thing to do with any cloud service provider is to set the budget alerts according to our wallet, be it one with credits or otherwise. At this point, I don't even try any new cloud service provider who doesn't offer credible budget alerts.

Another key takeaway is,

> Huh, no "CacheControl" value. But there wasn't one on any of the previous zip files either and the Cloudflare page rule above should be overriding anything here by virtue of the edge cache TTL setting anyway.

Even this could blow up. All cloud service providers set the "CacheControl" to "No" and if we would want to cache something which is not cached by CF by default e.g. *html using Page Rules then we need to set CacheControl (e.g. max-age) at the cloud service provider end too.

P.S. I've written about these recently on my blog titled 'Saving Cloud Costs'[1] from a frugal solopreneur PoV.

[1] https://hitstartup.com/saving-cloud-costs/

emptybottle 4 years ago |

This is why I personally won't run projects on infrastructure with what roughly equates to unlimited risk billing.

It's my opinion that it's better to work with known limitations and optimize for them.

In the case of bandwidth, work with a fixed pipe size, or do the math and set up a QoS that implements a throttle to avoid exceeding your bandwidth allotment.

jskrablin 4 years ago |

First thing one should always set on any cloud account is billing alerts. Set > 1 and set first to ~ 80% of what you think will be your normal cost then add extra alerts all the way up to 100%. That way you'll usually get an early warning with some time to act before it becomes really expensive.

pontifier 4 years ago |

Everything can be going fine for a long time, and then cloud costs kill your business.

This happened to Murfie a couple of years ago, and that's why I had to step in to try to fix things. I'm still trying, and there are still challenges, but I won't allow landlords and cloud costs to disrupt things again.

mathattack 4 years ago |

Think about how many big companies struggle with his. Most don’t have one person who can think through the cost of the cloud, as well as the activities to manage the costs. Many even say “Let engineers be engineers, and business people own the costs.” And all of a sudden you get a ton of surprises…

dtx1 4 years ago |

If Microsoft doesn't show the decency to forgive that bill, i'd be happy to chip in!

fleddr 4 years ago |

Cloud providers should really start protecting customers from these spikes. Alerts are not enough, there should also be hard caps (stop serving) and soft caps (serve at reduced speed/capacity) based on configured max budgets.

intricatedetail 4 years ago |

If you are not a VC backed corporation you must be insane to run anything on a "cloud". Why not rent a dedicated server from OVH or others where you can actually control costs and pay 10-100 times less?

Nextgrid 4 years ago | |

Because experience getting shit done using boring tools doesn't translate well to a future career in a VC-backend company wrangling Terraform & YAML files.

bawolff 4 years ago |

Seems at least a little unethical that cloud companies do pay as you go up to infinity, instead of some model where you transfer money in and if you use it all up your service gets cut.

XorNot 4 years ago | |

There'd be value in a model which allowed you to pay up to some limit then switch into a user-pays model if the user wanted the service right now.

polote 4 years ago |

As I spent a few hours to successfully get cf cache b2 files. I'm curious about the part of support Cloudflare requests due to caching issues.

It's time for cf to work a bit on its UX

sergiotapia 4 years ago |

>This was about AU$350 a day for a month. It really hurt, and it shouldn't have happened. I should have picked up on it earlier and had safeguards in place to ensure it didn't happen. It's on me.

Uh no - it's on cloudflare and azure. Why don't they have a global setting that says Max Charges Per Month: $X and it just shuts down when it hits that number? This is why I don't really like using big cloud services like this.

rcarmo 4 years ago |

This prompted me to go and check my custom static site generator (which renders my blog onto an Azure storage account exposed via HTTP and Cloudflare).

Turns out I wasn't setting x-ms-cache-control when writing all the blobs, so that's a win right there.

(interestingly, it appears that rclone, which I was in the process of moving to, doesn't do that, so I might have to keep my custom Azure storage library around)

mro_name 4 years ago |

Shouldn't lookups be where cdb shines? Hold my beer:

  $ shard="$(echo "${sha1}" | cut -c 1)"
  $ cdb -q pwned-passwords-v8-sha1-${shard}.cdb "${sha1}"

But as a cloud evangelist at Microsoft, you may sing the corporate IT gospel anyway. ¹https://mro.name/agakdfa

BonoboIO 4 years ago |

Well ... it's not like it was the first time this happened to a software developer.

He should have known better that there is a risk, that you don't know some detail that costs you a lot of money.

Cloud Bandwidth is soooooooooo expensive. If there is a risk that you have to pay this, please us a provider like Hetzner with fixed costs. If you like your serverless things, just host the big files at Hetzner.

commandlinefan 4 years ago |

> I always knew bandwidth on Azure was expensive and I should have been monitoring it better

It's suspicious that cloud providers STILL don't have any sort of "circuit breaker" infrastructure for this sort of thing - yes, you can set up alerts, but you can't say, "shut the whole thing down before the costs go above a certain threshold".

rkwasny 4 years ago |

I guess all Microsoft PR and Marketing departments are now on the phone trying to get this guy a refund and take down this post :)

throwawayffffas 4 years ago | |

This guy is a Microsoft Regional Director he is part of the Microsoft PR engine.

jve 4 years ago |

> I, uh, have a bill I need to pay

Kind of sad that service we are accustomed to using, various software integrates it (whether using HIBP API or downloaded pwned passwords archive) - is on a shoulder of single guy that now has to pay for his mistake.

Great that Cloudflare helps him with the service, otherwise who knows if we had access to HIBP at this scale?

razzio 4 years ago |

Hope it is okay and not too much off-topic. I just donated. He deserves it for this service!

Fact is that stuff like this can happen. Consider how many variables are in play to determine the final cost of a cloud service it is very much a double-edged sword. Sometimes you cut yourself unintentionally.

So now we all learn from this, I suggest we help him out.

queuebert 4 years ago |

Looking forward for the followup post in early 2033 when he forgets to extend the cost alert expiration.

lysecret 4 years ago |

This is a big trap to fall in to. I dont understand why network trafficking is so expencive also in AWS. I once had a 2k monthly bill purely from networking because i accidentally routed a lot of requests through a NAT. That hurt haha. Now i stay away from those things :D

jrochkind1 4 years ago |

> But these would always cache at the Cloudflare edge node, that's why I could provide the service for free, and I'd done a bunch of work with the folks there to make sure the bandwidth from the origin service was negligible.

If you're not Troy Hunt or another celebrity with special access to Cloudflare -- I don't think you really have access to Cloudflare to do a lot of work with you to ensure that your data gets cached and your egress is minimal, for large files on a very cheap cloudflare plan. (Based on the costs reported by Hunt as catastrophic, I don't think he's paying cloudflare for a large enterprise plan)

(Also, it's unclear if caching large data like this is even within the ToS of Cloudflare?)

I don't think Cloudflare promises to cache any particular URLs for any particular amounts of time (except no greater than cache headers etc; but they don't promise never to evict from cache sooner; they evict LRU according to their own policies). Cloudflare's marketed purposes include globally distributed performance, and security. I don't think they include "saving egress charges by long-term caching your data".

I have a much smaller project, but egress charges for data are an increasingly large part of my budget. I've been trying to figure out what if anything can be done about it. I wish I had a guaranteed way to get ultra-long-cache promise-to-be-within-ToS for very large data files from Cloudflare for a affordable fixed-rate price. (Maybe I do? But just haven't reassured myself of it yet?)

> In desperation, I reached out to a friend at Cloudflare… I recalled a discussion years earlier where Cloudflare had upped the cacheable size… Since then, Cloudflare upped that 15GB limit…

Since I'm looking for solutions for this same problem (delivering lots of data at very cheap prices), I am finding myself a bit annoyed that Hunt is talking about how he solved it, using tools/price-levels not available to most of us who don't have his level of access due to position.

Interestingly, MSN/Azure is part of the "Bandwidth Alliance" with cloudflare, which initially one thinks means there are no egress charges when delivering to cloudflare. (That is what it means for some other alliance members like backblaze). But that's clearly not the case or this story wouldn't happen, right? Turns out Azure gives you a fairly small egress discount when delivering to cloudflare, and only if you set things up in a non-standard way.

superphil0 4 years ago |

First thing i do is set an alert when costs go over 10$ for any new project. Highly recommend

onion2k 4 years ago | |

Do you also make sure you never go on vacation, never go anywhere that doesn't have a phone signal, never turn off your phone, that your alerts have multiple levels of redundancy, and that you always have access to a computer to modify settings?

progx 4 years ago |

Clouds are good for quick start and fast grow. But after this phase, you should think about "classic" hosting solutions (multiserver, load balancer, etc.), they could be much cheaper.

as long as your human admin costs are lower then cloud services

cgtyoder 4 years ago |

It's unconscionable that MS doesn't have warning notifications in place BY DEFAULT, so when you start incurring charges e.g. 10x of normal, you get notified immediately. One shouldn't have to set these up manually ever.

philliphaydon 4 years ago |

It seems like everyone is blaming azure when this was an issue with CloudFlare…

I get that everyone has an obsession with dirt cheap providers instead of cloud solutions like aws/azure. But that doesn’t mean it’s better. Everything has pros and cons.

lkxijlewlf 4 years ago |

I'm sure some cloud providers have it, but they all should have a global, "If my account hits $XXX shut it all down immediately and email me" flag. And yes, that's kind of what he did here, I get that.

hogrider 4 years ago |

I wonder if people will start to make shell companies to just go brankrupt when this happens and start afresh with another company. The cloud vendor doesn't look too closely ehat you are running right? So this could work.

pibefision 4 years ago |

Most of the clouds have functionalities to manage this. In AWS for example you can create an alarm with AWS Budget to monitor costs by tools/service/etc. Using a complex cloud without using this is not good practice.

taubek 4 years ago |

It is good thing to know that this could happen to anyone. I guess that setting limits and alters should be one of the first things that one should do.

What would happen if a credit card limit was exceeded, a site would just stop working?

therealbilly 4 years ago |

Yeah the problem with Cloud vendors is that if they make a mistake, it will usually disadvantage the customer...not them. I'm a little biased as I don't completely buy into the whole Cloud paradigm.

csours 4 years ago |

Cloud seems like a pet tiger - really cool and fun, until it turns on you.

pdimitar 4 years ago |

Enjoyed the article.

But still, couldn't help to get the following lasting impression after reading it: these days being able to click around the UIs of the cloud providers should be a billable skill by itself.

floor_ 4 years ago |

This guy needs to clean up his bio. There seems to be a lot of confusion on whether or not he works for Microsoft when it appears that he is a uhh... reverse pay midlevel manager inter?

Havoc 4 years ago |

These things really should have a AI like alert that is basically “cost is departing dramatically from historical pattern” without the need to set thresholds and the like

TacticalCoder 4 years ago |

Are there cloud services that allow to easily put a maximum budget, to make sure you have no surprise costs like that?

napolux 4 years ago | |

In my experience you can only setup billing alerts, which are fair, if you ask me.

I took a good course on pluralsight about AWS and the first lesson was to setup a billing alert.

What will hard limits will do to your infra? You can't take down / suspend DBs, EC2s, etc... Just because you set a 1k USD limit and that's it.

Alerts are the 1st thing you should setup IMHO

notreallyserio 4 years ago | | |

> You can't take down / suspend DBs, EC2s, etc... Just because you set a 1k USD limit and that's it.

You (the cloud provider) can shut down VMs, block access to all services, and just retain the content in storage until the bill is resolved or the account is permanently closed. The cost would be trivial as storage is dirt cheap.

snovv_crash 4 years ago | |

Google App Engine allows you to set up hard spending caps, after which your application will start returning 503s

unixhero 4 years ago |

It would be good if he contacts Microsoft about this. Sometimes they will give credits for situations such as this.

goodguyamericun 4 years ago | |

He is Troy hunt and an ms MVP, as soon as ms gets wind, they'd be the one to contact him

3pt14159 4 years ago |

Happily donated to Troy. He's done more than most to help everyday folks weather these data breaches.

dx034 4 years ago | |

My issue with this is that the donation is basically to Microsoft for their dark patterns. There's no way this traffic cost much to Microsoft, so it all is added profit for their shareholders. Other providers would've provided the same service and bandwidth for a much lower price.

I really appreciate the work that Troy is doing, but seeing much needed money ending up and Microsoft or Amazon leaves a bitter taste. I hope at some point it will become cool again to just rent a VM or dedicated server for small projects and stop throwing so much money at the already richest people in the world.

jimmydorry 4 years ago | | |

Unfortunately, data in Aus really costs this much (more actually), from my experience colocating in a few data centres (I was typically paying $0.3/GB). It’s certainly possible it cost them less, but very doubtful on it being close to free.

EDIT: Apparently it was hosted out of US West, so I agree that the actual data cost would probably be a lot less.

YetAnotherNick 4 years ago |

I don't understand it. Does a cloudflare edge server sit inside Azure?

mstrem 4 years ago | |

No. Cloudflare is configured as a reverse proxy in front of the site. So traffic reaches the Cloudflare edge first, then it is proxied to the origin on Azure unless the file is served directly from the Cloudflare cache.

rob_c 4 years ago |

close account, cancel card and move on with life before they charge you.

lom 4 years ago |

If anything, this shows the insane scalability of the cloud

joking 4 years ago |

outbound transfer cost is one of the most expensive things in cloud computing, it's much better when you can pay for allocated bandwith.

Mave83 4 years ago |

Just avoid cloud and choose dedicated infrastructure

_8j50 4 years ago |

Didn't Troy sell HIBP to Verizon?

joantune 4 years ago |

Donated! Hope it helps

parentheses 4 years ago |

TL;DR: I got a big bill from my cloud provider, so I used more cloud provider features, to make sure I know before I get the bill; isn't my cloud provider great?

lpcvoid 4 years ago |

Can somebody explain to me why I wouldn't just rent a 40 EUR dedicated server from Hetzner with unlimited traffic and gigabit uplink? His 600GB/day is way less than what you get over a gigabit link within a day. Sure, sudden bursts would perhaps "throttle" at a gigabit, but according to his article that was only the cloudflare proxy anyhow, so no pain in having that take a few seconds longer.

As far as I am concerned, I just don't understand why people use cloud services.

technion 4 years ago | |

He is a Microsoft MVP. A title that is given for being a "community evangelist" of Microsoft. You wouldn't get that throwing it on a Heztner machine.

Edit: Consider this article, and Geoff's statement about Azure credits.

https://www.theregister.com/2021/04/21/microsoft_revokes_mvp...

fs111 4 years ago | | |

Sounds like a pretty expensive privilege.

How is using cloudflare okay in this then? Cloudflare is also not Azure

pdimitar 4 years ago | | |

Grooming influential people to promote your corp and then bullying them when they didn't turn out to be just parroting your marketing slogans. Classic corporations.

kingcharles 4 years ago | | |

Huh. As an MVP myself (of DRM lol) I have to agree that was a poor astroturfing idea of Microsoft's. Although one employee != Microsoft. In all my MVP years Microsoft has never asked me to do anything like that. They've sent me to cool parties and events, but never asked for me to do anything as a result.

nunez 4 years ago | | |

Lol I KNEW IT! An independent consultant blogging about awesome things in Azure? #doubt

Seriously, yeah, if he's an MVP, he'll be fine.

southerntofu 4 years ago | |

Just did the calc and 600GB/day is about 55Mbit/s. That's really not a lot and if there's not too much computation server-side you could serve this from a raspberry pi at home (provided you have good uplink). But that's assuming you keep the CloudFlare cache of course, or as author mentioned himself, advertising only torrents for the multi-gig files.

I really don't understand the cloud craze. Everything is more complex to debug, more expensive, and more shitty in all the possible ways you can imagine. I mean i was not exactly a fan of the VPS craze 10-15 years ago, but at least it wouldn't automatically ruin your bank account whenever you got a little traffic.

Kudos to the author for having so much money (thousands in one month?!) to waste. I wish i did too :)

brodouevencode 4 years ago | | |

> Everything is more complex to debug, more expensive, and more shitty in all the possible ways you can imagine.

Coming from traditional infrastructure and development methods, you're mostly right. Part of the expectation of the cloud is that you do things _their way_. And even then each cloud provider does things a little differently. However, if you're willing to subscribe to the <insert provider> way of doing things it (and you'll have to trust me here) makes many things easier. Here's a short list:

* networking setup is free/cheap/doesn't require a Cisco cert. you can trust a developer to set things up.

* object storage is so much easier than any file hosting scheme you can come up with

* the path from container-on-a-host to container-in-a-cluster to container-in-{serverless,k8s} is extremely straightforward

* I turn all my dev/test servers off at night and they don't cost me a thing

* consumption based compute will result in a much cheaper solution than a VPS or colo (admittedly there are many assumptions baked into this)

* some core services (like sqs, sns on Amazon) are extremely cheap and have provably reduced development time because you're not having to build these abstractions yourself.

This all being said I'm not advocating an all-in approach without thinking it through, but to do so where it's easy and makes sense.

EDIT: clarity

Spooky23 4 years ago | | |

When you are growing, it’s a no brainer. When you are at steady state it depends.

As a case in point, I worked in standing up a critical system in a large enterprise a few years ago. We spent about $12M on compute, storage, networking, etc. At operational state, it was about 40% cheaper than AWS. The problem is, it all sat there for 6-18 months filling up before we fully hit that state.

With a cloud provider, you pay a high unit cost but if you engineer intelligently your costs should move with utilization. Except for government, most entities generally want to see opex move with revenue and prefer to minimize capex where possible.

SkipperCat 4 years ago | | |

The cloud is great for scaling. The lead time for new servers deployed in a data center is weeks compared to seconds in the cloud. Plus there's no sunk cost in the cloud - you can turn it off when done and it evaporates.

Also, the cloud offers managed software as a service. You don't have to manage your own HA DB cluster or PubSub. It's all just there and it works. That can save you a lot on technical labor costs.

But yes, I do agree with your point. If you don't know what you're doing, you can nuke your budget super quick.

jollybean 4 years ago | | |

"I really don't understand the cloud craze"

The opposite, I don't understand why anyone would ever put up a server if they didn't have to.

It's not 'processing power' that's going to be the 'big cost' for most projects.

It's headcount and salary.

If you can materially improve the operating ability of your company, then a few $K in cloud fees is dirt cheap.

I used to work at a 'tech company' that made a physical product and our IT was abysmal. We had to wait weeks for our sysadmins to order blades, get things set up, there were outages etc..

If a project is definitely going to be 'a few linux servers and never more' - even then it would be cheaper and more reasonable to use virtual instances.

The time to 'roll your own' is when the infra. operating costs are a material part of your business.

For example, 'Dropbox' invariably had to roll their own infra, that was inevitable.

Similarly others.

That said - as this article indicates, it's easy to 'over do it' and end up in ridiculous amounts of complexity.

The Amazon IAM security model has always been bizarre and confusing, and the number of AWS services is mind-boggling.

But the core case of EC2+S3 +Networking, and then maybe a couple of other enhanced services for special case works fine.

I also object to what I think is a vast overuse of Cloudflare, I just don't believe that in most scenarios needing to have content at the edge really changes the experience that much.

sockpuppet69 4 years ago | | |

> 600GB/day is about 55Mbit/s

In what universe? This frictionless perfect vacuum where traffic comes in a wholly predictable consistent continuum?

oblio 4 years ago | | |

You're not the target audience.

Startups growing fast are the secondary audience.

The primary audience is large enterprises where their internal IT costs <<more>> than the cloud costs. Plus internal IT provides those resources after 6 months...

Retric 4 years ago | | |

Most people that use cloud computing aren’t stuck with the bills the companies they work for are.

As to difficulty, they “solve” organizational problems by avoiding sticker shock when someone wants 100+k in equip that’s often a huge number of hoops to jump through and possibly months of delays, a giant bill every month and nobody a complains about the electric bill etc.

rr808 4 years ago | | |

> 600GB/day is about 55Mbit/s. not really it was minimal traffic then sudden bursts of gigabytes. Of course throttling the big spikes would actually have been a good idea in hindsight to give an early warning.

TheIronMark 4 years ago | | |

> but at least it wouldn't automatically ruin your bank account whenever you got a little traffic.

This only happens when consumers fail to set budget alerts. Troy could have saved himself $10k with 15min worth of work.

hnbad 4 years ago | |

I think it is an irresponsible fad that people use cloud services for hobby projects (and despite its wide popularity I'm calling HIBP a hobby project since he's running it on the side for free) unless they have solid cloud ops experience from their day job.

Cloud providers love it when people do this and are famously easy to talk to when you get an unexpected invoice high enough to require remortgaging your house to even begin addressing it, but I think unless you're working on a side hustle that inherently will need to run in the cloud regardless of scale or are experimenting with cloud technologies in an explicitly time boxed toy project, using cloud services is the financial equivalent of handing a hobbyist craftsperson one of these chainsaw angle grinder attachments that even professionals find hard to keep from bouncing into your body.

If you do want to use cloud services for anything you pay out of your own pocket, the first consideration should be cost management and monitoring. Your employer might have big enough pockets to shrug off a runaway compute instance you forgot about for a month, but that can quickly translate into money that can be anything from inconvenient to life altering if it comes out of your personal budget.

Or just stick with the free tier and make sure everything simply shuts down if you run out. Sure, a "bandwidth exceeded" error page might not get you as many upvotes on HN, Reddit or social media, but it also won't impair your finances.

pcthrowaway 4 years ago | | |

I don't know what the alternative is. Run a home server and pay an ISP $$$ for unusually high upload bandwidth/throughput? 99/100 times running it in the cloud is going to be cheaper, easier, and more resilient.

Of course, the delayed sticker shock is a problem.. I think Google cloud actually lets you create a budget that turns services off if they go over, so there's a solution here if you run a hobby project that you suspect might take off and cost you more than it's worth.

papito 4 years ago | | |

My cloud costs for my micro instance are about $12 a month. Multiple domains on there. I don't use RDS, ElasticCache, not even load balancers. If you want to keep the costs reasonable, you must roll that stuff on your own, which is totally possible (and free), and in fact kind of fun as a learning experience.

bennyp101 4 years ago | |

Because it's not cool, and won't make your CV sparkle.

I'm sure there becomes a point where cost of (hardware + maintenance + staffing) > (cloud + staffing), in which case sure crack on. But like you, I'll stick to a rented server for my stuff.

omegalulw 4 years ago | | |

The direction is opposite IMO. As you grow bigger on prem starts making a lot more sense.

pid-1 4 years ago | |

I have a few dozens of personal projects on AWS using APIGW, Lambda, CloudFront, Dynamo DB and S3.

Their monthly cost is something between 0 and a few cents.

Stuff like Hertzner is fine, but if you know your way around AWS you realize have massive cost savings. Prob the same for Azure.

Finally, in many places 40 EUR for a pet project is actually a lot of money.

welterde 4 years ago | | |

Probably would run just fine on a <= 4 euro/month virtual machine too. Of course it doesn't quite scale to zero like APIGW,lambda,etc. but on the other hand you can be fairly confident to not pay more if your pet project suddenly lands on the front page of HN.

llampx 4 years ago | | |

> Finally, in many places 40 EUR for a pet project is actually a lot of money.

Doesn't change the equation, unless you set up all your PAYG cloud infrastructure and never use it.

fuzzy2 4 years ago | |

That dedicated server you have to manage (ensure security, install the software you need, keep it updated and secure etc). It’s not for everyone.

Also, as you can see in a screenshot on TFA: Some services are simply dirt cheap. The storage account and its various “sub-services” is such a thing. It’s hard to compete with dedicated hardware here.

Depending on your dedicated hosting provider, the traffic cost trap exists, too. Hetzner is a bit of a special case.

ghughes 4 years ago | | |

> ensure security, install the software you need, keep it updated and secure etc

These things are now trivial enough that it doesn't make sense to pay 10x the cost of bare metal for a cloud provider to solve them for you unless you have a crazy amount of runway or absolutely no idea what you're doing.

sildur 4 years ago | | |

> That dedicated server you have to manage (ensure security, install the software you need, keep it updated and secure etc). It’s not for everyone.

apt install unattended-upgrades. And Hetzner's firewall.

dx034 4 years ago | | |

Most cloud users will have a VM somewhere which you also have to manage.

creshal 4 years ago | | |

> That dedicated server you have to manage (ensure security, install the software you need, keep it updated and secure etc). It’s not for everyone.

Hetzner also offers managed servers where all this is taken care of, for relatively fair prices.

FpUser 4 years ago | | |

>"That dedicated server you have to manage (ensure security, install the software you need, keep it updated and secure etc). It’s not for everyone."

Typical FUD. On modern servers and the type of software it occupies very little time. You'd spend more managing your cloud architecture.

BlueTemplar 4 years ago | | |

Arguably Hetzner is a cloud operator too. I guess it's a spectrum...

bluedino 4 years ago | |

I wonder if the disk on a $40 Hetzner server would be fast/big enough for him. All the searching and storing of massive password hash collections.

He has a writeup here on how he gets costs down in a big way: https://www.troyhunt.com/serverless-to-the-max-doing-big-thi...

pdimitar 4 years ago | | |

I tried to scan through the linked article (and OP) but couldn't quite figure out Troy's storage requirements. Are they really massive?

The sum of the GB figured shown in the OP doesn't even amount to 200GB AFAICT. But even if it's something like 10TB that's still not super expensive on many hosting providers.

tetha 4 years ago | |

It depends somewhat on the organizational skillset you have, in my opinion.

Current workplace is considering a fully self-hosted stack as a unique selling point for the customers and segments we're in. That means, we have storage and linux admins available, as well as tooling and know-how how to run this securely and efficiently. Thus, placing large and often downloaded files on our file stores at hetzner is very much a no-brainer, because it adds very little workload to the teams maintaining these stores and it's cheap.

However, this can be a daunting thing if you don't have this skillset in the org. It can be learned, but that's time spent not working on the product (and it's not trivial to learn good administrative practices from the hell that google results can be). At such a point, a cloud service just costs you less man-hours. And again - it wouldn't be much time for me, but it would be a lot of time if you had to figure all of that out on the fly. That's essentially why the saying goes that cloud services save you time, but cost money.

selestify 4 years ago | | |

Where is a good place to learn good administrative practices?

zarzavat 4 years ago | |

> I just don't understand why people use cloud services.

1, when they need to adjust rapidly between different resource usage profiles, e.g. because they are growing rapidly and can't predict what the usage will be X days in advance

2. They have huge resource requirements and don't care to invest in their own infrastructure, but can negotiate lower rates with a cloud provider

3. When their resource usage is modest but profitability is high enough that cloud expenditure is a rounding error

tlamponi 4 years ago | | |

> 1, when they need to adjust rapidly between different resource usage profiles, e.g. because they are growing rapidly and can't predict what the usage will be X days in advance

One can add new servers in minutes, removing has a bit more latency to it, but I'd figure with the huge price difference between rented and cloud you'll come out on top with the former in most case. Also, just use a clustering or orchestration layer in between, they range from very simple to setup and use (e.g., Proxmox VE), to quite complex but also very capable (OpenShift, kubernetes, ...).

> 2. They have huge resource requirements and don't care to invest in their own infrastructure, but can negotiate lower rates with a cloud provider

Using hetzner or other providers is not investing in their own infra, that's using (= renting) the providers infra and ability (peering, fast uplinks, datacenter perks like utility redundancy and staff on site). The second sentence may be true but probably not for most use cases that aren't huge yet, like the post here.

> 3. When their resource usage is modest but profitability is high enough that cloud expenditure is a rounding error

IFF, yes, and often infra costs are relatively low compared to salary costs, so that's definitively some optimization problem one should go through when deciding such things. Chances are that for most projects the profitability can be good but not magic money printing and infra costs are a non-negligible part that eats on their revenue, and then it's definitively worthwhile to think about avoiding the high premium most of those cloud offerings ask for.

dmurray 4 years ago | | |

4. When their resource usage used to be modest, so they got on cloud services for increased developer convenience, and now can't afford the switching costs even though their bills are expensive.

andi999 4 years ago | |

Maybe one wants to mantain the application and not the server? Long time ago i booked a vps, install some bsd on it and thought i am good.

A month later a ntp security vulnerability was discovered, soon the server was put offline, some 'patch your things asap' not so nice emails came in. From that time my take is one should spend some time probably daily on an own server if one wants to mantain it.

pmlnr 4 years ago | | |

Right, because a barebone docker hypervisor needs so much admining.

sdze 4 years ago | | |

Aren't Azure Compute Nodes also "bare metal"?

hardwaresofton 4 years ago | |

Well there’s a gap between the amount of convenience you get on the major clouds and one like Hetzner.

I’m a huge Hetzner fan, and their cloud offering is definitely growing but still isn’t as convenient and featureful as it could be (and they don’t share their roadmap currently so hard to tell what they’re working on next).

I’m trying to do something about it though, working on Nimbus Web Services[0]. In my mind all we need is something to bridge the managed services gap and make it very easy to set up the basic 3 tier app with some amount of scale/performance elasticity!

[0]: https://nimbusws.com

dx034 4 years ago | | |

But he could've put static files on a Hetzner server and still have his backend in Azure. That would've solved these issues and probably saved even more money.

fbrncci 4 years ago | |

I have a pretty complicated architecture that would cost me about 20-35$ if it was hosted just on Digitalocean or Hetzner. Instead its AWS ...soon to be multicloud, and costs me about 140$/mo (which does vary). But it does allow me to experiment, write long articles and design some fun stuff; about which I blog on my own website. The blog has gotten me both clients on freelance projects and enough "cred" to start on new projects I don't have any resume experience on. That's the only reason that I personally use cloud services (of course, the reasons for SaaS/Enterprise clients are usually more valid than mine).

rhn_mk1 4 years ago | | |

What stops you from having a blog on Hetzner? That doesn't seem like it has anything to do with AWS whatsoever... or do they offer a blogging pltform?

InsomniacL 4 years ago | |

- Patching - Remediation, Monitoring, day0 response

- Security Information and Event Management - exports, alerts, OS configuration

- OS/Application Hardening - Encryption, Password/keys rotation, CIS/other baselines, Drift Management

- Backup - Encryption, (don't forget your passwords/keys are changing), retention, data protection compliance, monitoring, alerting, test days

- High Availability - replication, synchronisation, monitoring, alerts, test days

This is just the tip of the ice berg, if you operate in an environment where Insurance, Reputation, Regulatory Compliance, certification, etc.. are important, then it's easy to see why PAAS solutions are desirable.

chillfox 4 years ago | | |

Eh, if my bank goes down or gets compromised then I will hold it against them regardless of if they are self hosting or using the cloud.

rcarmo 4 years ago | |

Because they provide managed services that VPS hosters don't have or which would require the overhead of maintaining and patching servers, and many people just want to get on with their lives instead of worrying about OS exploits...

martin_a 4 years ago | | |

That's why you take some kind of "managed hosting" where all of this is taken care of.

alpaca128 4 years ago | | |

But they do offer managed servers.

lazyant 4 years ago | |

If you only need a server, as in CPU, RAM, disk and bandwidth, with a more or less constant demand, then sure, a dedicated server is way cheaper than any cloud. You want to use cloud for the ecosystem of other services besides VM/instances, and especially to use them in an automated way. The other use case is elastic demand.

lvass 4 years ago | |

IIRC, hertzner "unlimited" traffic isn't quite unlimited. You have a few monthly TB depending on what you contracted, if you go over it there's massive speed reductions until you pay a fee.