Writing an LLM from scratch, part 32a – Interventions: training a baseline model | Dark Hacker News