lerrel / rllab-adv
Code to train RL agents along with Adversarial distrubance agents
☆63Updated 7 years ago
Related projects: ⓘ
- Gym environments modified with adversarial agents☆35Updated 7 years ago
- Deep Variational Reinforcement Learning☆132Updated 2 years ago
- ☆95Updated last year
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 4 years ago
- ☆90Updated 9 months ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- ☆42Updated 7 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆86Updated 6 years ago
- ☆30Updated 10 months ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆77Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆85Updated 5 years ago
- ☆59Updated 6 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆37Updated last year
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆92Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆47Updated 2 years ago
- ☆65Updated 6 months ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- ☆41Updated 5 years ago
- A library of probabilistic model based RL algorithms in pytorch☆106Updated 3 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆65Updated 4 years ago
- ☆79Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 5 years ago
- Hindsight policy gradients☆42Updated 4 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 4 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆60Updated 5 years ago
- Hierarchical Deep RL Network☆29Updated 7 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆25Updated 4 years ago