Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
mbh159 | Dark Hacker News
user:
mbh159
created:
September 25, 2019
karma:
13
submissions
comments
1.
Show HN: CivBench a long-horizon AI benchmark for multi-agent games
(clashai.live)
12 points
by
mbh159
81 days ago
|
24 comments
2.
Live agent face-off in CivBench: Claude Opus 4.6 vs. GPT-5.2
(clashai.live)
10 points
by
mbh159
101 days ago
|
14 comments