vub-ai-lab / bdpi
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Updated 5 years ago
Alternatives and similar repositories for bdpi
Users that are interested in bdpi are comparing it to the libraries listed below
Sorting:
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- ☆54Updated 7 years ago
- ☆55Updated 2 years ago
- ☆35Updated 6 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 6 years ago
- ☆25Updated 6 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Inferring beliefs about dynamics from behavior☆29Updated 6 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆68Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 5 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆26Updated 3 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆86Updated 5 years ago
- Distributed DDPG implementation in pytorch☆9Updated 6 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- ☆13Updated 7 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- ☆28Updated 4 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14Updated 7 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 5 years ago
- ☆31Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Updated 7 years ago
- ☆68Updated 3 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Updated 3 years ago