A Bayesian Perspective on Q-Learning | Dark Hacker News