LLM Speedrunner: Eval for frontier models to reproduce scientific findings | Dark Hacker News