Natural Emergent Misalignment from Reward Hacking in Production RL [pdf] | Dark Hacker News