Confessions can keep language models honest | Dark Hacker News