vub-ai-lab / bdpi
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Updated 5 years ago
Related projects: ⓘ
- Code for "Divide-and-Conquer Reinforcement Learning"☆60Updated 5 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆33Updated last year
- ☆26Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆28Updated 5 years ago
- Autoregressive policies for continuous control reinforcement learning☆28Updated 5 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- Inferring beliefs about dynamics from behavior☆28Updated 6 years ago
- ☆54Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- ☆46Updated 4 years ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- ☆35Updated 6 years ago
- Guided-Meta Policy Search☆41Updated last year
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27Updated 4 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14Updated 6 years ago
- ☆35Updated this week
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆22Updated 5 years ago
- ☆53Updated 2 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆83Updated 4 years ago
- Code for the Reset-free Trial and Error learning paper (RTE) experiments☆10Updated 6 years ago
- ☆15Updated 3 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆77Updated 5 years ago
- Gym environments for Robots that learn to interact with the environment autonomously☆34Updated last year
- ☆34Updated 5 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆44Updated last year