DeepSpeed: Extreme-scale model training for everyone | Dark Hacker News