Reinforcement Learning from Human Feedback: When the Math Ain't Enough | Dark Hacker News