DiceBench: A Simple Task Humans Fundamentally Cannot Do (But AI Might)(dice-bench.vercel.app) |
DiceBench: A Simple Task Humans Fundamentally Cannot Do (But AI Might)(dice-bench.vercel.app) |
But maybe we need simpler examples that demonstrate fundamentally different ways of processing information. The dice prediction isn't important - what matters is finding clean examples where all information is visible, but humans are cognitively limited in processing it, regardless of time or expertise.
It's about moving beyond human performance as our primary reference point for measuring AI capabilities.