Avatarl: Training language models from scratch with pure reinforcement learning(tokenbender.com)9 points by Gusarich 282 days ago | 0 commentsNo comments yet