Reward hacking is swamping model intelligence gains(cursor.com)3 points by DR_MING 6 days ago | 0 commentsNo comments yet