abhisheknaik96 / continuing-rl-expsLinks
Code for running RL experiments on continuing (non-episodic) problems.
☆17Updated 2 weeks ago
Alternatives and similar repositories for continuing-rl-exps
Users that are interested in continuing-rl-exps are comparing it to the libraries listed below
Sorting:
- ☆41Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆77Updated last month
- This is the official implementation of Multi-Agent PPO.☆106Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆156Updated last year
- Transformer in RL for decision-making☆97Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- ☆96Updated 3 years ago
- Constrained Policy Optimization implementation on Safety Gym☆27Updated 3 years ago
- ☆61Updated 4 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆79Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆185Updated 6 months ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆18Updated 2 years ago
- ☆103Updated 3 months ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆86Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 3 months ago
- DSAC; Distributional Soft Actor-Critic☆127Updated 3 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- ☆63Updated 3 weeks ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆70Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆137Updated last year
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago
- ☆204Updated 2 years ago
- Codes of GoMARL accompanying the paper "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning"(NeurIPS 2023). G…☆28Updated 9 months ago
- PyTorch implementation of Constrained Policy Optimization☆54Updated 3 years ago
- ☆39Updated 2 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago
- TD3 in Pytorch☆34Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆174Updated last year