The Benchmark Saturation Problem: Why AI Evaluation Needs Systems Thinking(distributedthoughts.org)2 points by TheIronYuppie 234 days ago | 0 commentsNo comments yet