Writing an LLM from scratch, part 32g – Interventions: weight tying | Dark Hacker News