Measuring AI Ability to Complete Long Tasks – METR | Dark Hacker News