jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gradient and Hindsight Experience Replay (HER)
☆37Updated 6 years ago
Alternatives and similar repositories for Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-:
Users that are interested in Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control- are comparing it to the libraries listed below
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆36Updated 4 years ago
- soft q learning and soft actor critic☆15Updated 6 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 5 years ago
- Residual policy learning☆61Updated 6 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆58Updated 4 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆154Updated 3 years ago
- ☆91Updated last year
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 5 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Library for model based RL in robotics☆37Updated 6 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 5 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆84Updated 6 years ago
- ☆84Updated 6 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 6 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆47Updated 5 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆23Updated 5 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆33Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 7 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 2 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆45Updated 5 years ago