Two Leaps to 1000 Tokens/s on a 1T-Parameter Model | Dark Hacker News