LLM from scratch, part 29 – using DDP to train a base model in the cloud | Dark Hacker News