ZifanWu / Coordinated-PPO
Code accompanying paper "Coordinated Proximal Policy Optimization"
☆11Updated 3 years ago
Alternatives and similar repositories for Coordinated-PPO
Users that are interested in Coordinated-PPO are comparing it to the libraries listed below
Sorting:
- Implementation of DyMA-CL, MARL algorithm☆27Updated 5 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles☆14Updated 5 years ago
- Implementation of PPO Lagrangian in PyTorch☆44Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆77Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆170Updated last year
- Code for Weighted QMIX☆136Updated 4 years ago
- ☆96Updated 3 years ago
- There will be updates later☆84Updated 6 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆54Updated last year
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆84Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆33Updated 3 years ago
- ☆72Updated last year
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆14Updated 5 years ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆18Updated 2 years ago
- pytorch实现的一些MARL算法☆65Updated 4 years ago
- ☆42Updated 3 years ago
- Code for paper Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety.☆20Updated 2 years ago
- PyTorch implementation of Constrained Policy Optimization☆54Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆88Updated last year
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆61Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 2 months ago
- ☆39Updated 2 years ago
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆33Updated last year
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- an implementation of ATOC☆14Updated 3 years ago
- Constrained Policy Optimization implementation on Safety Gym☆27Updated 3 years ago