Does RL Incentivize Reasoning in LLMs Beyond the Base Model? | Dark Hacker News