Why Run RL? How specialized models can outperform the biggest LLMs | Dark Hacker News