Proximal Policy Optimization