Open Source Models Score Low on ARC-AGI-2 Reasoning Benchmark | Dark Hacker News