Detecting misbehavior in frontier reasoning models | Dark Hacker News