Scaling LLMs apps via accuracy, latency, cost | Dark Hacker News