Discretizing Reward Models | Dark Hacker News