Show HN: Why Neural Networks Need He Init, Clipping, and Momentum | Dark Hacker News