Explaining Reinforcement Learning with Human Feedback (RLHF) | Dark Hacker News