Show HN: RewardHackBench: Using sandboxes to stop agents from cheating | Dark Hacker News