Bandits and Reinforcement Learning (Fall 2017) | Dark Hacker News