alexanderbaumann99 / PPO-Algorithms

Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' environment.
12Updated 3 years ago

Related projects

Alternatives and complementary repositories for PPO-Algorithms