A system programmer's guide to LLM inference | Dark Hacker News