Writing an LLM from scratch, part 32B – Interventions: gradient clipping | Dark Hacker News