lerrel / gym-adv
Gym environments modified with adversarial agents
☆36Updated 8 years ago
Alternatives and similar repositories for gym-adv:
Users that are interested in gym-adv are comparing it to the libraries listed below
- Code to train RL agents along with Adversarial distrubance agents☆64Updated 8 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- ☆60Updated 6 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆71Updated 2 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Updated last year
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- ☆91Updated last year
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Updated 5 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆95Updated 5 years ago
- ☆42Updated 6 years ago
- ☆98Updated 2 years ago
- ☆75Updated 10 months ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆24Updated 5 years ago
- ☆53Updated last year
- NeurIPS Reproducibility Challenge 2019☆20Updated 5 years ago
- ☆66Updated 4 years ago
- Implementation of the Option-Critic Architecture☆39Updated 6 years ago
- ☆84Updated 6 years ago
- ☆83Updated 4 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆33Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- Model-Based Offline Reinforcement Learning☆50Updated 4 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆66Updated last year
- Simple maze environments using mujoco-py☆54Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago