jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-

Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gradient and Hindsight Experience Replay (HER)
36Updated 5 years ago

Related projects

Alternatives and complementary repositories for Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-