ubiquition / drl
☆23Updated last year
Alternatives and similar repositories for drl:
Users that are interested in drl are comparing it to the libraries listed below
- ☆102Updated last week
- ☆37Updated 2 months ago
- DSAC; Distributional Soft Actor-Critic☆123Updated last week
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated 4 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆84Updated last year
- a clean and robust Pytorch implementation of SAC on continuous action space☆66Updated 8 months ago
- ☆94Updated 3 years ago
- ☆70Updated last year
- ☆54Updated 3 weeks ago
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆32Updated 9 months ago
- PyTorch implementation of Constrained Policy Optimization☆51Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆99Updated 3 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆34Updated 3 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆161Updated 10 months ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆149Updated last year
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆38Updated 6 years ago
- Implementation of PPO Lagrangian in PyTorch☆36Updated 2 years ago
- ☆16Updated 2 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆172Updated 4 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆14Updated last year
- reinforcement learning algorithm for mapless navigation☆64Updated 3 years ago
- ☆41Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆55Updated 8 months ago
- ☆38Updated 2 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆164Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆118Updated 10 months ago
- A collection of recent MARL papers☆85Updated 3 months ago
- ☆26Updated last month
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆58Updated last year
- ☆23Updated 5 years ago