Show HN: Realtime LLM Chat on an 8GB Nvidia GPU | Dark Hacker News