abhisheknaik96 / continuing-rl-exps
Code for running RL experiments on continuing (non-episodic) problems.
☆17Updated this week
Alternatives and similar repositories for continuing-rl-exps:
Users that are interested in continuing-rl-exps are comparing it to the libraries listed below
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- ☆102Updated 2 months ago
- Constrained Policy Optimization implementation on Safety Gym☆25Updated 3 years ago
- ☆42Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- A collection of offline reinforcement learning algorithms.☆178Updated 5 months ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆154Updated last year
- ☆96Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- ☆16Updated 2 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆73Updated last week
- 🚀 A fast safe reinforcement learning library in PyTorch☆185Updated 6 months ago
- ☆93Updated 4 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆166Updated last year
- ☆59Updated 4 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆127Updated last year
- There will be updates later☆84Updated 5 years ago
- ☆29Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆166Updated last year
- This is the official implementation of Multi-Agent PPO.☆104Updated 2 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- PyTorch implementation of Constrained Policy Optimization☆53Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆74Updated last year
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆54Updated last year
- I2Q: A Fully Decentralized Q-Learning Algorithm☆18Updated 2 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆61Updated last year