Spaghetti Bench: Evaluating AI Agents on Concurrency Bug Fixes(pastalab.org)2 points by aoli-al 89 days ago | 0 commentsNo comments yet