Measuring AI Ability to Complete Long Tasks – METR(metr.org)2 points by diginova 283 days ago | 0 commentsNo comments yet