Understanding Attention Residuals | Dark Hacker News