Writing an LLM from scratch, part 32d – Interventions: adding attention bias | Dark Hacker News