Kimi introduces Attention Residuals: 1.25x compute performance at <2% overhead | Dark Hacker News