How RLHF Preference Model Tuning Works (and How Things May Go Wrong) | Dark Hacker News