How LLMs learn to reason: A deep dive into post-training strategies | Dark Hacker News