TournO: Tournament Optimization for Non-Verifiable RL | Dark Hacker News