Training a 70B Model from Scratch | Dark Hacker News