Speedup Megatron-LM from 10% to 30% with Zero Bubble Pipeline Parallelism | Dark Hacker News