No Train No Gain:Revisiting Efficient Training Algrthm for Transformer-BasedLM | Dark Hacker News