Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
Evaluating Coding Agents with Terminal-Bench 2.0
(snorkel.ai)
2 points
by
vinhnx
11 days ago
| 0 comments
No comments yet
Evaluating Coding Agents with Terminal-Bench 2.0 | Dark Hacker News