AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard MDP(arxiv.org)1 points by kenny239 271 days ago | 0 comments