We reduced our server costs by moving away from AWS

We reduced our server costs by moving away from AWS(levelup.gitconnected.com)

440 points by caberus 3 years ago | 344 comments

mabbo 3 years ago |

I'll always celebrate stories like this, but I also don't take some kind of anti-AWS lesson from it.

This company saved $800k/year. Perfect time to go in-house with this solution.

But when they were 1/10th this size, they'd only have saved $80k/year. Does that cover the cost of the engineering to build and maintain this system? Maybe not. And when they were 1/100th the size, it would have been laughable to go in-house.

At the right time, you make the right transitions.

MajimasEyepatch 3 years ago | |

Thank you for bringing up the engineering cost. People always look at this as just AWS > Bare metal or whatever, but there's so much more to it than that.

If they saved $800k per year, and they have to hire four additional ops engineers to run it at a cost of $400k per year, then they actually saved $400k. Which is still substantial and, all else being equal, sounds worthwhile.

If they saved $800k per year, and they have to hire ten additional ops engineers to run it at a cost of $1 million per year, then they've actually gone and burned $200k on something that provides no additional value to the business or their customers.

jjav 3 years ago | | |

> have to hire

But [to state the obvious but sometimes overlooked] you don't just point an AWS account at the company git repo and walk away.

There's a lot of work and expertise needed to keep AWS setup up and running, so you already have to hire people.

At a modest size startup we already have close to ten people DevOps team to manage AWS. That same size team could easily keep bare metal servers running. At our scale it's still a bit cheaper to be on AWS, but not too far in the growth curve it'll start to become cheaper to be on bare metal.

Dma54rhs 3 years ago | | |

AWS knowledge and engineering doesn't come for free either. People have built whole careers and businesses around it.

rstuart4133 3 years ago | | |

> If they saved $800k per year, and they have to hire four additional ops engineers to run it at a cost of $400k per year,

I guess you've never done it yourself?

I'm not sure what those 4 engineers would be doing. You purchase a few servers with 5 year on site warranty and remote management, you take a couple of days to install them in some racks, and you never visit the site for 5 years. If they break, the manufacturer sends someone to fix them on site. The rest of the time you administer them remotely - just like you would AWS. If you want VM's install proxmox.

Whether it's cheaper than renting bare metal in a data centre is an debatable - colo costs a small fortune where I live.

But they aren't comparing it to that. They are comparing it to locking themselves into a closed source system that need specialised expertise to run, versus administering an open source Linux system that even the people running EC2 should be very familiar with. Most of the stuff AWS provides has open source counterparts - hell a whole pile of it is just open source they wrap a proprietary API around and charge you for.

10 times sounds like a lot, I'm guessing it's really less. Even so I come form the camp that shakes his head in disbelief at what people will pay AWS for basic VM and storage services, dressed up in a fancy API and marketing.

mbesto 3 years ago | | |

Exactly. In the "old IT world" we call this TCO = Total Cost of Ownership.

tester756 3 years ago | | |

huge salaries + ten "ops engineers", lol.

I know *data centers* that run on a few naive 20 yos admins/technicans + 1-2 "engineers" and all of them combined do receive salary of $5-10k/month in east eu

jiveturkey 3 years ago | | |

You're not including reliability and availability projections, and the intangible cost of transferable skills wrt infrastructure. (ability to hire sufficiently skilled people to run it)

otabdeveloper4 3 years ago | | |

Hiring a person to do 'docker compose up' for you is orders of magnitude cheaper than whatever AWS-specific knowldege is needed to not have AWS crap its bed.

hintymad 3 years ago | |

People don't consider productivity? Maybe things have gotten a lot better in the industry now. Otherwise, to rehash an older comment on HN:

I'd like to remind everyone about Uber's experience: no EC2-like functionality until at least 2018, probably even now. Teams would negotiate with CTO for more machines. Uber's container-based solution didn't support persistent volumes for years. Uber's distributed database was based on friendfeed's design and was notoriously harder to use than DynamoDB or Cassandra. Uber's engineers couldn't provision Cassandra instances via API. They had to fill in a 10-pager to justify their use cases. Uber's on-rack router broke back in 2017 and the networking team didn't know about it because their dashboard was not properly set up and what the funk is eBPF? Uber tried but failed to build anything even closer to S3. Uber's HDFS cluster was grossly inefficient and expensive. That is, Uber's productivity sucked because they didn't have the out-of-box flexibility offered by cloud.

Melatonic 3 years ago | | |

That also just sounds like Uber had hired crap talent....

humanwhosits 3 years ago | | |

and they had trouble moving workloads to the cloud because bringing up new capacity was a giant set of circular microservice dependencies

systemvoltage 3 years ago | |

The initial bit is important though. It creates a circular dependency. If you start out without AWS, your entire company's software from how you build monoliths/microservices/queues changes.

Look at Stack Overflow's architecture which stands apart because it was never designed to work in cloud from the beginning: https://stackexchange.com/performance

I'd argue that 90% of the SaaS doesn't have SO's scale. The whole thing would work just fine on a couple of FreeBSD servers running postgres and un-dockerized monolith. Half a rack at most with redundancy and replication.

But, if you've built your whole company around proprietary lamda functions and a vast range of AWS offerings, you're setting up yourself to never get out of the mess.

mr_toad 3 years ago | | |

> The whole thing would work just fine on a couple of FreeBSD servers

And you need dev servers. And a way to keep them in sync with production because you can’t just create a branch and provision a bare metal server on the fly. And you need a deployment process to get code from dev through test, UAT and production, and this process is lengthy and fraught because you can’t just fire up a new instance and switch the elastic IP, you actually have to deploy code on the physical machine and make sure there aren’t any configuration differences between your test and prod environments that’ll cause ‘it worked before’ problems. Developer productivity tanks, people are afraid to make changes, any pretence of agile/devops goes out the window and eventually everyone gets sick of the crusty old server and decides a complete rewrite is needed.

tekknik 3 years ago | | |

> I'd argue that 90% of the SaaS doesn't have SO's scale.

While Stack Overflow is popular in the dev world, not so much outside of it. The show they’re only serving 300 req/s. 19 servers to power the entire stack as well. I wouldn’t call this efficient.

jacobsenscott 3 years ago | | |

Yes - but the FOMO industry is too strong.

lbriner 3 years ago | |

Yes, exactly this.

At what point do you have the time/money/confidence to invest goodness knows how much in a data centre with space to grow, to purchase an enormous amount of capital to have it all installed etc. the building alone could eat that first years saving easily.

How many people are now needed to fault-find bad hardware/software/networks, to be on call for any problems? How many calls out to the Electrician to fix some power issue?

How much to setup and run a large air-con system for the data centre. Maybe not much in the US where aircon is common but much more expensive in Europe.

The fact they could afford to do this over such a short time period speaks to having a decent amount of cash on-hand.

hamandcheese 3 years ago | | |

> At what point do you have the time/money/confidence to invest goodness knows how much in a data centre with space to grow, to purchase an enormous amount of capital to have it all installed etc. the building alone could eat that first years saving easily.

Co-locating has no capital investment other than hardware, and is pretty cheap.

A 40U rack of compute charged as equivalent ec2 instances has a retail price easily of hundreds of thousands, if not a million+ USD per year.

Suppose each U has a $10k capital cost to make the numbers round, that is $400k in capital.

All this to say is that I don’t think capital is as big a factor as you might think.

jjav 3 years ago | | |

> fix some power issue

> large air-con system

You wouldn't usually jump from AWS to buying up real estate to build your own physical data center.

A sensible first step is to rent a rack at a colocation facility. They handle power, cooling, redundancy, physical access for you.

i_have_an_idea 3 years ago | |

Anecdotal, but for one of my projects, Google Cloud / Compute Engine VMs cost around ~$5k a month all in. The exact same setup, when we moved it to LiquidWeb, cost us $2k.

Don't underestimate the savings that can be made from switching from a big-name cloud provider to a more old school hosting provider.

outworlder 3 years ago | |

Thank you!

At work, people keep complaining about our costs and coming up with spreadsheets showing how much money we would save with our own hardware.

They never add the engineering costs. When they do, they forget to include the ongoing maintenance. Or the new SMEs that need to be hired (and on call). Or even the opportunity cost of doing a multi-year migration to arrive at the exact spot they already are today.

All that money, and noone is looking into optimizing our systems to shrink the bill...

xani_ 3 years ago | |

80k ? Honestly probably does. Our ops team of 3 spends maybe 10% of the time on the managing of few hardware racks we have in our local colocation. There are even months where nothing at hardware/hypervisor level is touched

oxfordmale 3 years ago | | |

3 people at 0.10% of their time is already 24K, assuming an 80K salary for each of you. You don't mention patching systems, or the time spend replacing the hardware racks every x years. It is very easy to underestimate the cost of maintenance.

hisorange 3 years ago | |

Hey there, (Zsolt Varga here)

Yep, we should have pointed out that this has to be done in at the right time. Generally this article was about how we saved $1M expense and we was able to share this saving with our customers. Not an attack on AWS, I still use their platform for my other projects.

But to be accurate, this migration happened last year, and now we already doubled our usage and this brings us to $2M / year saving now.

I 100% agree with the comments below, use AWS while you are defining your product and growing, Todd (the founder) did this right. Spending days and nights on managing your own infrastructure or paying a devops employee to do it, in the early phase is just pure waste, and takes away focus. But we moved away from AWS at the right time.

Also, yes we had to do this to be competitive, as prerendering is a resource heavy service, and if your site is like ProductBoard (awesome tool) which will not consume terabits of traffic or use petabytes of ram, then you can stay on AWS forever and enjoy the benefit of not caring.

And what we lost? Minor inconvenience at this point, I cannot just pull up a new database in 10 minutes, but we don't really do that anymore, most of our current projects requires months of research, planning, and delivery. With those time frames we can notify our devops and get everything in place way before we need it.

So, keep on using AWS (it rocks!), just be sure to pay attention to the bill, and don't underestimate the cost of the traffic.

Have a nice one

chii 3 years ago | |

> This company saved $800k/year. Perfect time to go in-house with this solution.

$800k/yr is like the cost of 2-3 engineers. Even if they're capable of doing all of the work that used to have been done by aws, you don't have any room to expand without having to put up high capital costs, and certainly not on a dime.

It sounds amazing now, but wait till the future comes and you can no longer run your own data center as cheaply as amazon.

senectus1 3 years ago | |

this point is even more poignant when taking into account scalability and elasticity. Our usage of Azure/O365/AWS saved us a lot of time staff and money during our make it or break it growth period. now we're a lot bigger and more stable we're having to reconsider allowing MS/Amazon bending us over the barrel quite so much.

vlunkr 3 years ago | |

Also your company needs to be mature enough to know really well what your hardware requirements are. For a growing company, it's really great to switch your RDS instance to a bigger type because your database load has tripled in a few months and you didn't know that was coming.

joshstrange 3 years ago |

They don't mention at all what services they were using (other than slight mention of S3) which makes it very hard to respond to this. If you are running everything on EC2 then you are going to have a bad time (especially if you aren't using reserved instances).

AWS (IMHO) shines with the various services they provide (S3, Lambda, CloudFront, API Gateway, SQS< SES, to name a few). AWS is a game of trying to reduce your bill and often that means using AWS-specific services. If you want to stay completely "cloud agnostic" you are going to paying more than buying into a "cloud", in that scenario then you absolutely should be looking at dedicated servers. AWS is great because you can bring existing software and just run it in EC2 (or their container stuff if your software is containerized) but the AWS magic comes from using their managed services (also spinning up/down EC2 instances as needed, but if you are running them 24/7 then consider alternatives or at least pay for reserved).

alberth 3 years ago |

Dedicated hosting providers.

I'm so amazed that somehow people completely forget that for literally decades, web host provided dedicated hosting options at fantastic prices.

Yes, loooong time ago - to get your dedicated server might have taken a few hours to provision and the instant server access that AWS brought should not be discredited.

But large numbers of web host today allow you to programmatically spin up a dedicated web host instantaneously and at a fraction of the cost.

floatinglotus 3 years ago |

This is the Trillion Dollar Paradox described by Martin Casado. You’re crazy if you don’t start your business in the cloud, you’re crazy if you stay there.

My new startup is focused on helping application owners repatriate their workloads into their own infrastructure.

Our goal is to solve the network complexity challenges with a fully open network stack (open source software with all of the hardware options you would expect, and some you wouldn’t). The solution is designed to be turnkey and require very little network experience. It will use standard DevOps tools that you’re already using.

We’re announcing it in two weeks and will be posting info here on HN!

P5fRxh5kUvp2th 3 years ago |

I'm glad to see more of these types of articles, but at the same time I'm a bit flabbergasted that this isn't obvious for so many people.

These cloud providers are, by definition, charging you more than it would cost you to run it yourself. What you get in return is a guarantee of expertise and an ecosystem.

0xbadcafebee 3 years ago |

They saved $800K on their AWS bill, but

    - may spend $250K on servers, replaced after 3 years becomes $83k/yr
    - may spend $120-250K on extra staff to maintain the infrastructure
    - may spend $15K for a cage in a DC

They still save $452K/yr overall (actual savings 1st year only $285K). It's still a savings for sure, but always keep TCO in mind.

The real fun comes later when you outgrow your cage and there's not enough space left in that DC, or they just have shitty service constantly knocking out your racks, and you have to consider splitting your infra between DCs (a huge rewrite) or moving DCs (a huge literal lift and shift). Have been part of both, it's... definitely a learning experience.

maerF0x0 3 years ago |

I've said this a hundred times and it seems not loud enough.

AWS is not cheap because of your server costs.

AWS is cheap because of elasticity, velocity (opportunity cost of next feature), and reduced maintenance hours.

"The cloud" was never (afaik) was about getting a cheaper VPS. It was about being able to get them on demand, give them back on demand, and generally not have to maintain anything besides your code (and maybe apply updates to datastores / AMIs)

Now, if those premises are not true for your startup/business, then AWS is not the tool for you. I didnt see any analysis of ongoing maintenance costs in the 800k saved, but will it take 1-2 FTE engineers to now be more oncall, more server upgrades, more security patches etc? That's easily 1/2 that savings gone already.

Edit: for the most part these attributes apply to GCP, Azure, Heroku etc as well, its not just about AWS

Gregioei 3 years ago |

So little real information...

So the team is now responsible for backups, hardware ordering,.forecast etc?

How big is the team now compared to before?

Does it scale?

If you price it correctly and keep the free tier small, I would either talked to AWS for better pricing or moved to another cloud Provider.

S3 on AWS is a total no-brainer, minio on bare metal might mean much more work and a bigger infra team than business actually wants.

I would also love to know what optimizations are already in place. Does cloudflare caching work? Are the results compressed on rest? Is geolocation latency relevant?

Why even Cassandra? Are websites not unique? Wouldn't a nginx and a few big servers not work?

But who knows? The article doesn't tell me that :-(

hazmazlaz 3 years ago |

There is no way this figure is accurate. The annual spend cited of $1,000,000 is purely hypothetical, as admitted here:

"However, all this data and processes need to happen on a server and, of course, we used AWS for it. A few years of growth later, we’re handling over 70,000 pages per minute, storing around 560 million pages, and paying well over $1,000,000 per year.

Or at least we would be paying that much if we stayed with AWS. Instead, we were able to cut costs by 80% in a little over three months with some out-of-the-box thinking and a clear plan."

pessimizer 3 years ago | |

> There is no way this figure is accurate.

You've got the FUD covered, but you also need to add at least some substance to your claim. How do you know this figure would not be accurate? Why is your (hypothetical, not offered) estimate better than the author's?

hazmazlaz 3 years ago | | |

Just as a basic sanity check, they claim to store 560 million pages of pure HTML. Internet Archive estimates the median HTML weight of a page to be ~20kb, which I fudged to 30kb in order to make a liberal estimate. Multiplied by the number of pages that would amount to approximately 16.8TB of data stored in s3. According to the AWS pricing calculator that would cost between $300-400 per month, and that's in s3 standard tier pricing with no intelligent tiering set up for less often retrieved data. I didn't bother calculating the compute cost for their rendering because I don't really understand how they do it, and also because the storage estimate was so far off of their claim that it didn't seem necessary to bother in order to refute the claim. I appreciate your challenge for additional evidence, and this is what I have to offer in response. I won't pretend like I did an exhaustive scientific study, this is more back of the napkin type stuff FWIW, but here it is.

merb 3 years ago | | |

well the problem with the article is basically that they left out a lot of important detail, like which bare metal servers, how many, where do they host now, did they use cloudfront what about cloudflare did they use an edge cache? what about reducing costs by killing stopping unneeded resources? a lot of their workload looks dynamic. it's also fishy what they wrote here:

> After testing whether Prerender pages could be cached in both S3 and minio, we slowly diverted traffic away from AWS S3 and towards minio.

if they served directly from s3 that would be, stupid?

> In the last four weeks, we moved most of the cache workload from AWS S3 to our own Cassandra cluster.

is also strange. it misses a lot of detail but it does not look like they just migrated away from s3...

(looks like their new hoster is hetzner, from service.prerender.io )

m0llusk 3 years ago |

There is not quite enough information here to be sure, but this article highlights transmission costs. This particular business model involves throwing around big chunks of data just in case they end up being needed and then handing them back out in response to potentially large numbers of requests. That would make this particular usage pattern fit to exactly what AWS is charging the most for. Also many alternative AWS services that can be used to speed up or simplify services are not really going to help with this case.

So an alternative way of interpreting this is more along the lines of: We may have saved up to 80% of server costs by moving from AWS, but you almost certainly won't save that much even if a bunch gets spent on developing operations and tools.

varsketiz 3 years ago | |

You can save even more if your app uses richer formats than images.

Also, if you are bigger and can start really negotiating with hardware providers.

maxfurman 3 years ago |

I feel like there's a middle step missing from this article (or I just missed it reading quickly) - did they build their own data center? Where are these new non-AWS servers physically located?

Joel_Mckay 3 years ago |

In general, last time I looked at AWS it made sense from 2TB to 30TB a month, and under 400k connections a day. If either range was exceeded, than the service ceased to be the economical choice when compared with CDN providers, and colo/self-managed unlimited-traffic options.

For example, if you primarily serve large media or many tiny files to clients that don't support http Multipart Types, than AWS can cost a lot more than the alternatives. However, AWS is generally an economical cloud provider, and a good option for those who outsourced most of their IT infrastructure.

The article would be better if it cited where the variable costs arose.

TheGuyWhoCodes 3 years ago |

"We used Apache Cassandra nodes on each of the servers that were compatible with AWS S3". What does this even mean?

Regardless, starting a new Cassandra cluster in late 2022?! I bet they can save even more by just going with scylladb

joshstrange 3 years ago | |

That was also confusing to me as Cassandra is a NoSQL DB last I checked. I found this [0] online that indicates with some extra software you can talk to it like S3 but yeah...

[0] https://dzone.com/articles/s3-compatible-storage-with-cassan...

epberry 3 years ago |

I believe this is the use case Cloudflare is really targeting with R2. They recently connected Cache Reserve to R2 to make this even easier. We wrote up a breakdown for S3 vs R2 and found that R2 would be significantly cheaper when the majority of traffic is cached data, https://www.vantage.sh/blog/cloudflare-r2-aws-s3-comparison

gibsonf1 3 years ago |

We've just finished moving servers from AWS to https://hetzner.com - and saved 10X with servers of double the capability. A great experience so far.

jacooper 3 years ago | |

How did you get them to approve a large amount of Cloud instances / dedicated servers?

I heard they are very stubborn to increase the per user limit of cloud instances.

Also how did you deal with S3? Did you switch to another provider ? Like B2?

CoolCold 3 years ago | | |

May be it was not large. Say for one project DB server on Hetzner with 160GB RAM, 32cores/64theads,2x3.84TB NVMe + 2x512GB SSD costs ~ 240$, hosting mid-sized ~ 1.6TB MySQL DB. On managed RDS it was around ~ $2200/month when I checked.

Even if you have 2 such boxes for master/replica, it's close to 5x savings.

gibsonf1 3 years ago | | |

We haven't run into the number of servers issue. For S3, we've switched to Wasabi which very nicely uses the identical API.

hnrodey 3 years ago |

I find this interesting, if nothing else. My first question is what was the opportunity cost of focusing manpower to setting up on-prem infrastructure that now needs maintained? What on the product roadmap was sacrificed/delayed in exchange for the time on this project? What are the projected future hiring costs to maintain these servers (and applications like Cassandra!) going forward? Nothing is free, and at just 4-5 additional hires they will be giving back a large chunk of that $800k to employees. IDK - maybe that's a fare trade-off to pump up the common man with money instead of the establishment.

brodouevencode 3 years ago | |

This is what a lot of people miss when they talk about moving to/from cloud providers. The marginal cost to add X more servers in the cloud is basically nothing, whereas to set up a new rack for on-prem requires requirements gathering, purchase orders, finance approvals, someone being at the dock when the UPS truck arrives, rack and stacks, etc. Those are one-time, yet very real costs. These fall under cap-ex which accounting-wise is treated very differently than op-ex. Now the cost in the cloud bakes all that in, and is distributed around with other users of the provider. Your accounting models are also easier ("pay for what you use").

Couple that with the very well known fact that AWS has outrageous data egress charges and there are patterns that can emerge where you're still in cloud but not racking up massive outbound data charges.

fabian2k 3 years ago | | |

You can rent servers if you don't want to bother with this. The choice is not only between doing everything yourself and the cloud, there are a lot of options in between.

wglass 3 years ago |

I found the headline to be misleading. The article is mostly about the migration process (which is interesting), but very little about the details of the cost savings.

What does it cost to run their data center? What are the salaries they are paying for internal IT efforts to administer it? Is it an apples-to-apples comparison, e.g. are they load balancing across multiple datacenters in case of an outage?

It sounds like this was a good move for Prerender but it's hard to generalize the cost claims to other situations without details.

theptip 3 years ago |

Perhaps I'm missing it in the OP -- I don't see any mention of what they actually moved to. CoLo? VPS? On-prem?

This seems like a key detail when telling people about your migration off AWS.

jacooper 3 years ago | |

On-prem

shrubble 3 years ago |

I expect to see a great deal more of the "cheap and cheerful" AWS migration stories in the future. With the tanking of the market and (apparent) limits to growth being in the forefront, reducing expenses will become more important.

Before, it was easy to justify almost any expense with the "we just need to get 1% of this $100 billion market" and now it is "hunker down and do everything you can to be ramen-profitable, in order to survive and thrive".

taldo 3 years ago |

When the cost of delivering your product/service is mostly compute or traffic, sure, migrating off of AWS is a must once you reach a certain scale. But for the other 99% [0], where infrastructure is but a small cost, then think really hard if you're willing to trade the engineering effort for the convenience of managed cloud services.

[0]: or 90%, or 80% or who cares, but a majority of software services seem to NOT be compute- or traffic- heavy.

registeredcorn 3 years ago |

(Note: I have never done any professional work in cloud. I could be completely mistaken. Feel free to correct me if I'm completely off-base.)

It's a fascinating article, for sure. I would have been interested to hear what their backup strategy looked like though.

One of the big benefits of cloud services, that I am aware of, is the assurance that if natural disaster strikes, you don't lose all of your data. I kind of got the impression that, more than anything else, that is what you are paying for. Data protection and uptime.

I suppose big enough bills could lead a company to make the kinds of changes that Prerender did, but when that disaster does strike, and it is time to try and recover from a fire, flood, earthquake, etc. the responsibility and speed of getting your customers back online is reliant completely upon your staff - a staff who might be extremely shaken up, hurt, or pre-occupied in taking care of their own affairs. I'm not saying it's not possible, but there is a kind of cost that comes in the form of responsibility. It's a trade off that I would not fault many people from avoiding.

thoop 3 years ago |

Hi! I’m Todd, the solopreneur founder of Prerender.io and I created that $1,000,000/year AWS bill. I sold Prerender.io to Saas.group in 2020 and the new team has done an incredible job growing and changing Prerender since I left.

$1M per year bill is a lot, but the Prerender back end is extremely write-heavy. It’s constantly loading URLs in Chrome in order to update the cached HTML so that the HTML is ready for sub-second serving to Google and other crawlers.

Being a solo founder with a profitable product that was growing organically every month, I really didn’t have the time to personally embark on a big server migration with a bunch of unknown risks (since I had never run any bare metal servers before). So the architecture was set early on and AWS allowed me the flexibility to continue to scale while I focused on the rest of business.

Just for a little more context on what was part of that $1M bill, I was running 1,000+ ec2 spot instances running Chrome browsers (phantomjs in the early days). I forget which instance type but I generally tried to scale horizontally with more smaller instance sizes for a few different reasons. Those servers, the rest of the infrastructure around rendering and saving all the HTML, and some data costs ended up being a little more than 50% the bill. Running websites through Chrome at scale is not cheap!

I had something like 20 Postgres databases on RDS used for different shards containing URL metadata, like last recache date. It was so write heavy that I had to really shard the databases. For a while I had one single shard and I eventually ran into the postgres transaction ID wraparound failure. That was not fun so I definitely over provisioned RDS shards in the future to prevent that from happening again. I think RDS costs were like 10%.

All of the HTML was stored in s3 and the number of GET requests wasn’t too crazy but being so write heavy on PUT requests for recaching HTML, with a decent sized chunk of data, the servers to serve customer requests, and data-our from our public endpoint, that was probably 30%.

There were a few other things like SQS for populating recache queues, elasticache, etc.

I never bought reserved instances and I figured the new team would go down that route but they blew me away with what they were able to do with bare metal servers. So kudos to the current Prerender team for doing such great work! Maybe that helps provide a little more context for the great comments I’m seeing here.

throwaway20221 3 years ago |

Throwaway here. I work for a startup which runs compute-bound scientific simulations. We are considering migrating from our single self-managed HPC, to AWS EC2. We anticipate using the largest standard instances, with highly spikey usage: frequently none, at times up to 10 concurrent instances, perhaps more later.

AWS seems ideal to me because it would let us easily scale up and down as our usage varies. But some of the anti-AWS sentiment in this article has given me pause. Any reason not to do this? Any alternatives I've missed?

Our storage and transfer usage will be negligible; it's all compute.

jaclaz 3 years ago |

As a side note, I find this:

>Do you have any advice for software engineers who are just starting out?

>Don’t be afraid to talk with the customers. Throughout my career, the best software engineers were the ones who worked with the customer to solve their problems. Sometimes you can sack a half year of development time just by learning that you can solve the customer’s issue with a single line of code. I think the best engineers are creating solutions for real world problems.

to be very good generic advice.

jmyeet 3 years ago |

This is unsurprising.

The point of AWS is to be flexible. You’re paying for that. It’s easy to start. It’s easy to stop. It’s easy to change capacity.

Running your own servers is none of these things. But it is cheaper at sufficient scale. You can’t ignore the labor cost (particularly engineering) however.

Where AWS shines is with highly volatile workloads. With your own servers you have to provision for peak capacity. That’s less the case with AWS.

No shade on the author of course. It’s great to read things like this.

jacooper 3 years ago |

Some S3 providers can give you free egress when using a CDN.

For example backblaze B2 offers free egress through Cloudflare, Fastly, BunnyCDN.

https://help.backblaze.com/hc/en-us/articles/217666928-Using...

otabdeveloper4 3 years ago |

No kidding? You reduced your server costs by moving away from the most expensive hoster on the planet? Good for you, I guess :)

xyzabc098123 3 years ago |

Want to add, cloud vendors' proposition is not about the capability; it's a basic requirement and no longer an USP for something to 'work with the cloud'. It's about security. It's about outsourcing the liability so that the manager has less responsibility.

alexchantavy 3 years ago |

I wish the article went into detail about what hardware they used for each server, what was their disaster mitigation plan, and other considerations that you don't need to worry about with paying for a cloud provider.

lakomen 3 years ago |

In other news, we got wet in the rain SCNR.

Are you really saying that AWS and other clouds are expensive? Say it ain't so :)

zc2 3 years ago |

Then you need to spend 2M to hire engineers helping you to maintain the in premise infra

henning 3 years ago |

OK, so they're now stuck maintaining their own Cassandra cluster. How much does that cost?

If it costs you $1,000,000 a year to serve 1166 requests a second, maybe you fucked up.

seattle_spring 3 years ago |

The company I used to work for. They successfully did cut server costs!

...at the expense of 40 eng-years (20 eng over 2 years) spent on the migration.

"My Google Cloud was suspended too" https://news.ycombinator.com/item?id=32571055 "Google suspended our domain out of the blue" https://news.ycombinator.com/item?id=32798368 "Tell HN: Google Cloud suspended our production projects at 1am on Saturday" https://news.ycombinator.com/item?id=32547912 "AWS account was permanently closed because it was suspended for 90 days" https://news.ycombinator.com/item?id=31571538 (probably happens with Azure and other smaller platforms as well, e.g. Hetzner, DigitalOcean, Vultr, Scaleway and so on)