The theory of Proximal Policy Optimisation implementations | Dark Hacker News