AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard MDP | Dark Hacker News