JanitorBench: A new LLM benchmark for multi-turn chats | Dark Hacker News