AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard MDP(arxiv.org)1 points by kenny239 317 days ago | 0 comments