Evaluating LLMs Is a Minefield | Dark Hacker News