Not "it'll take a while." Impossible.
Today this is my Saturday.
My - - : autonomous hyperparameter mutation, parallel experiments, no babysitting.
Results: → 17 experiments, 0 crashes → baseline 1.2365 → . _ → 1.48% improvement. Found by the loop.
The chart shows the staircase, the same pattern Karpathy sees in his runs. His: 2 days, 276 experiments. Mine: 1 hour, 17. Same logic, different constraints.
The constraint here is the ' _=, which means ~5.5% MFU. The GPU is mostly waiting on memory transfers, not computing.
What the same loop looks like on different hardware: ( ): ~ / → .% : ~+ / → -% (): ~+ / → -% (): ~+ / → -%
Same autoresearch loop. Just more runway.
Karpathy's setup, 8x H100 running 48 hours would likely hit 5,000+ experiments. On 4090s that's not feasible. But 24-48 hours on what I have would still find significantly more than 1 hour did. That's what's running next.
This is my own multi-GPU implementation built on top of his single-GPU original, orchestrated with iii functions, workers, and triggers.
Claude Code made the extension possible in a weekend.
If you want to try: https://github.com/iii-hq/n-autoresearch open source repo.