Scaling Reinforcement Learning for Trillion-Scale Thinking Model | Dark Hacker News