“You don't need this overengineered goo for your project.”

“You don't need this overengineered goo for your project.”(twitter.com)

134 points by TobiasA 4 years ago | 101 comments

nisa 4 years ago |

> “You don't need this overengineered goo for your project.”

k8s is probably a great excuse to think how to compose your infrastructure and software in a declarative way - I'm still fascinated by https://demo.kubevious.io/ - It just made "click" when playing with that demo - it's not goo it's a different operating system and a different mindset.

You can do 80% of that with docker-compose / swarm for small projects but:

If you read HN you are in a huge bubble - gruelsome patched tomcat7 apps on Java8 with 20 properties/ini/xml config files are still popular - hosting things in docker or doing ci/cd is still not mainstream. At least in Europe in the public sector stuff where I was involved.

Sure you can mock it - but the declarative approach is powerful - if you can pull it off to have it across all your infrastructure and code with ci/cd and tests you are fast.

This alone correctly implemented https://github.com/adobe/rules_gitops solves so many problems I can't count the useless meetings we had over any of these bullet points, bazel alone would have solved most major pain points in that project. Just by beeing explizit and declarative.

Don't believe the hype but it's a powerful weapon.

corobo 4 years ago |

I love these Twitter takes. They say something to get attention but if you look at it for more than a second it's.. it's just nothing data

Comparing a troubleshooting guide to running a site on a couple of servers is a bit too different for me. Compare it to a troubleshooting guide for those two servers, let's see how they stack up. No using any "ask {specific person}" either

Don't get me wrong, kubernetes is overkill for most side project level things. I don't disagree, I just like to see things knocked down a peg fairly!

Also as mentioned by this tweeter they use more than 2 servers anyway

https://twitter.com/shadowmanos/status/1434980544740306947

They could probably save on resources and maintenance effort if they switched to containers assuming this is still the same or more

> This is #1 in a very long series of posts on Stack Overflow’s architecture. Welcome.

codeulike 4 years ago | |

Here's the stats on Stackoverflow

https://stackexchange.com/performance

1.3 billion page views per month, 9 web servers, 4 sql servers

Stackoverflow is notable because they went down the C#/MVC/SQL Server route from the start, which meant much better performance per server. Thats why they make an interesting counterexample to the usual way...

actually_a_dog 4 years ago | | |

You left off 10 servers: 2 Redis servers, 3 tag engine servers, 3 Elasticsearch servers, and 2 HAProxy servers. So, that's 23 servers in total, which is not a trivial amount, but also not a huge number, either.

sofixa 4 years ago | | |

> Stackoverflow is notable because they went down the C#/MVC/SQL Server route from the start, which meant much better performance per server

And also notable because everything is under an expensive license, so big performant servers is the cheaper option.

Edit: everything = Windows servers for their .NET app ( apparently in the process of migrating to .NET Core) and SQL Server

hnbad 4 years ago | |

> No using any "ask {specific person}" either

It's worth mentioning that the diagram is explicitly incomplete. The yellow endpoints are fixes but except for "END" the other endpoints are all either "unknown state" (i.e. "I have no idea what's broken") or problems that aren't addressed in further detail like "The issue could be with Kube Proxy" or even "Consult StackOverflow".

I'm not sure what a complete diagram would even look like but I don't think there's any way to infer complexity by looking at them in comparison.

tyingq 4 years ago | |

People do tend to cherry pick, don't they? Most of Stack Overflow's workload is returning a blob of html for a given url, to a not-logged-in user. Where that html doesn't even have to be the most recently saved copy.

Cthulhu_ 4 years ago | |

It's taking one source - look at how these people solved it! - and trying to apply it to others.

SO is relatively simple; it's basically customized forum software which is a solved problem that has been around for decades. A junior dev can build an alternative, and it can be built using tried and true solutions like MySQL + PHP, which are horizontally scalable with database sharding, read replicas, and maybe stuff like memcached to accumulate votes before updating the database or a CDN for caching static files.

Google has different problems and different workloads, and they have hundreds of times more applications with thousands of times more load. Apples and oranges.

JimDabell 4 years ago | | |

> it can be built using tried and true solutions like MySQL + PHP, which are horizontally scalable with database sharding, read replicas, and maybe stuff like memcached to accumulate votes before updating the database or a CDN for caching static files.

> Google has different problems and different workloads

Which of these do you think most organisations most closely resemble?

I don’t think anybody would disagree if you said that you should use Kubernetes for organisations that resemble Google. But most organisations don’t look anything like Google. They look a lot more like Stack Overflow. So the “You don’t need this…” statement holds true for almost everyone.

tester34 4 years ago | | |

>A junior dev can build an alternative

Of course junior dev can do it, the same way junior dev can make Youtube

it'll work as long as there's less than 100 concurrent users on SO and less than 50 4K 20min videos on youtube

krageon 4 years ago | | |

> a solved problem

Except 99% of forums run terrible software that doesn't perform, is not easily usable and won't work right on phones. That tells me it's not a solved problem at all.

bsaul 4 years ago |

recently facing the dilemma of choosing between k8s vs something more basic.

Features that seemed to be advocating for k8s were not server provisionning, but instead :

log management, easy setup of blue/green & canary deployment, not having to restart a vm upon new code deployment, etc...

How would you do those things as easily with other techs ?

monus 4 years ago |

‪That’s not an architecture diagram though, so it doesn’t represent the complexity at all.

I’m sure a troubleshooting map for bare linux server wouldn’t be less complicated than that.‬

littlestymaar 4 years ago | |

> I’m sure a troubleshooting map for bare linux server wouldn’t be less complicated than that.‬

Except your k8s runs on a Linux server, so this is just an addition. (Unless you're using a fully managed k8s cloud offering, but then you have an even bigger toubleshooting flowchart to navigate the provider's management interface: at least that's my experience with GKE, maybe Amazon and others are better)

corobo 4 years ago | | |

> Except your k8s runs on a Linux server,

Wouldn't it be more likely in this case that the server is built from configs? Ansible or whatever

The troubleshooting for the Linux server side is "spin up a new one and delete the old one"

necrobrit 4 years ago | |

100% and one of the great things about k8s is that this diagram applies to essentially any application. Standardisation is awesome.

capableweb 4 years ago | | |

Proper standardization is awesome. De facto, corp-owned standarization not so much.

hughrr 4 years ago | | |

Unfortunately as a k8s user in the real world every container is slightly different and has numerous hacks in it to make it compatible with k8s in some way or another. So no.

Aeolun 4 years ago | |

To be fair, the steps seem to map pretty well to the number of kubernetes resources you would need to create to do basic things like add a persistent disk, or get traffic to your application.

ajb 4 years ago | |

When I first saw it K8S reminded me a lot of systemd. I wouldn't be surprised if over the next few years each grow the features of the other.

Max_aaa 4 years ago | | |

Meet podman:

https://developers.redhat.com/blog/2020/11/19/transitioning-...

zorr 4 years ago | | |

That sounds logical as they kind of perform the same tasks with the difference being that systemd manages workloads on a single system and k8s manages workloads on a cluster.

p_l 4 years ago |

Interestingly enough, SO is apparently going with k8s a lot...

https://stackoverflow.blog/2021/07/21/why-you-should-build-o...

Jabbles 4 years ago |

> StackOverflow runs on a couple of servers.

Does it?

DaGardner 4 years ago | |

Yes it does: https://stackexchange.com/performance

Pretty impressive I think.

chrismorgan 4 years ago | | |

No it doesn’t. From your link:

• 9 web servers

• 4 SQL servers

• 2 Redis servers

• 3 tag engine servers

• 3 Elasticsearch servers

• 2 HAProxy servers

That comes to 23. I know “a couple” is sometimes used to mean more than two, but… not that much more than two.

“A couple” is just flat-out wrong; I’d guess that he’s misinterpreting ancient figures, taking the figures from no later than about 2013 about how many web servers (ignoring other types, which are presently more than half) they needed to cope with the load (ignoring the lots more servers that they have for headroom, redundancy and future-readiness).

szszrk 4 years ago | | |

It is impressive, but it's not a raspberry pi kind of setup. Just two of those "couple" are hot and standby DB servers with 1.5TB RAM. That infrastructure is scaled A LOT vertically.

jusonchan81 4 years ago |

This seems like something you can implement in a workflow tool like Netflix Conductor and we can automate the debugging process with visuals.

kubanczyk 4 years ago |

Just the pic, archived: https://web.archive.org/web/20210907091921/https://pbs.twimg...

rapphil 4 years ago |

Nomad ftw

wayneftw 4 years ago |

This from a guy who sells over engineered ORM goo (LLBLGen).

StackOverflow didn't use that either and instead chose to invent their own query builder/mapper known as Dapper.

preommr 4 years ago |

> StackOverflow runs on a couple of servers.

K8s can as well.

The difference is a bunch of servers running k8s or a bunch of servers running custom code to duplicate parts of k8s.

mdoms 4 years ago | |

Or a couple of servers running IIS with a handful of web apps, maybe a reverse proxy.

VBprogrammer 4 years ago | | |

And FTP for putting your PHP scripts into production.

ed_elliott_asc 4 years ago |

I hate when stack overflow is held up as an example of how we can run any system on “a few servers” - stacknoverflow has like 3 features and has an engineering focus on the single goal of performance and keeping on running on the small subset of servers.

Every other project as different constraints.

luaybs 4 years ago |

This argument is nonsense

InternetPerson 4 years ago |

Attention! Someone who knows nothing about your project is offering free advice!!