Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
Evals in 2025: going beyond simple benchmarks to build models people can use
(github.com)
80 points
by
jxmorris12
28 days ago
| 8 comments
Evals in 2025: going beyond simple benchmarks to build models people can use | Dark Hacker News