Fastgen – SOTA LLM inference in 3k lines of Python | Dark Hacker News