LLM from scratch, part 18 – residuals, shortcut connections, and the Talmud | Dark Hacker News