DesikRengarajan / EMRLDLinks
[NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
☆12Updated 3 years ago
Alternatives and similar repositories for EMRLD
Users that are interested in EMRLD are comparing it to the libraries listed below
Sorting:
- ☆41Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆193Updated last year
- There will be updates later☆85Updated 6 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆80Updated 2 years ago
- ☆43Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆53Updated 7 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated 2 years ago
- ☆49Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- ☆40Updated 3 years ago
- ☆39Updated 3 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆41Updated 11 months ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆73Updated 6 years ago
- PyTorch implementation of Constrained Policy Optimization☆55Updated 3 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆30Updated 3 years ago
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆50Updated last year
- ☆12Updated last year
- Robust and safe deep reinforcement learning algorithms☆15Updated last year
- This is the official implementation of Multi-Agent PPO.☆118Updated 2 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆61Updated 2 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆25Updated 2 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14Updated 4 years ago
- Single-file pytorch implementation of hybrid-SAC☆59Updated 4 years ago
- Source code for paper: Efficient deep reinforcement learning via adaptive policy transfer☆15Updated 3 years ago
- ☆20Updated 2 years ago
- ☆102Updated 3 years ago
- ☆50Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆169Updated last year