jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gradient and Hindsight Experience Replay (HER)
☆35Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆36Updated 3 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 4 years ago
- soft q learning and soft actor critic☆15Updated 5 years ago
- Implementation of Linear Inverse Reinforcement Learning Algorithm (IRL) on Mountain Car Environment.☆29Updated 4 years ago
- A curated list of awesome Model-based reinforcement learning resources☆90Updated 4 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆43Updated 5 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆60Updated 5 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆53Updated last year
- Residual policy learning☆58Updated 5 years ago
- ☆90Updated 11 months ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 5 years ago
- Library for model based RL in robotics☆37Updated 6 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 5 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆33Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- ☆82Updated 5 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 4 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆152Updated 3 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆26Updated last year
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆112Updated 5 years ago
- accompanying code for neurips submission "Goal-conditioned Imitation Learning"☆67Updated last year
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆41Updated last year
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆23Updated 5 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆58Updated 4 years ago