Measuring AI Ability to Complete Long Software Tasks | Dark Hacker News