Scaling LLMs with Golang: How we serve millions of LLM requests | Dark Hacker News