Scaling pretraining affects RL sample efficiency | Dark Hacker News