No Train No Gain:Revisiting Efficient Training Algrthm for Transformer-BasedLM(arxiv.org)11 points by froster 2 years ago | 1 comment