Bluffbench is near saturation: LLMs can interpret counterintuitive plots | Dark Hacker News