[Research] Reinforcement Learning with A* and a Deep Heuristic | Dark Hacker News