jbuckman / dmdp-donutworldLinks
☆13Updated 6 years ago
Alternatives and similar repositories for dmdp-donutworld
Users that are interested in dmdp-donutworld are comparing it to the libraries listed below
Sorting:
- ☆99Updated 2 years ago
- Proximal Policy Option-Critic☆26Updated 7 years ago
- ☆92Updated 2 years ago
- Deep Variational Reinforcement Learning☆139Updated 3 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Updated 5 years ago
- ☆31Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 7 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆74Updated 8 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆246Updated 3 years ago
- NeurIPS Reproducibility Challenge 2019☆20Updated 5 years ago
- Hindsight policy gradients☆46Updated 5 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆34Updated 5 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 4 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆94Updated 7 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Updated 6 years ago
- ☆44Updated 7 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆182Updated 8 years ago
- ☆62Updated 7 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆198Updated 2 years ago
- ☆72Updated 6 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Updated 6 years ago
- Code to train RL agents along with Adversarial distrubance agents☆66Updated 8 years ago
- Implementation of the Option-Critic Architecture☆40Updated 7 years ago
- A3C style Option-Critic with deliberation cost☆40Updated 8 years ago
- ☆114Updated 2 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆68Updated 5 years ago
- Gym environments modified with adversarial agents☆36Updated 8 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆209Updated 6 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 6 years ago
- Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)☆101Updated 5 years ago