We might be overestimating coding agent performance on SWE-Bench | Dark Hacker News