ZifanWu / Coordinated-PPOLinks
Code accompanying paper "Coordinated Proximal Policy Optimization"
☆11Updated 3 years ago
Alternatives and similar repositories for Coordinated-PPO
Users that are interested in Coordinated-PPO are comparing it to the libraries listed below
Sorting:
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆193Updated last year
- ☆102Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆116Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆71Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆80Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆118Updated 2 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆61Updated 2 years ago
- pytorch实现的一些MARL算法☆68Updated 4 years ago
- an implementation of ATOC☆14Updated 3 years ago
- There will be updates later☆85Updated 6 years ago
- ☆43Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 4 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆68Updated 2 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆142Updated last year
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆50Updated 5 years ago
- ☆39Updated 3 years ago
- ☆217Updated 2 years ago
- Implementation of PPO Lagrangian in PyTorch☆50Updated 3 years ago
- Code for Weighted QMIX☆140Updated 4 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆91Updated 10 months ago
- DSAC; Distributional Soft Actor-Critic☆132Updated 8 months ago
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆125Updated 3 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆53Updated 7 months ago
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆85Updated last year
- I2Q: A Fully Decentralized Q-Learning Algorithm☆18Updated 2 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆73Updated 6 years ago
- ☆97Updated 4 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆38Updated 3 years ago
- Implementation of DyMA-CL, MARL algorithm☆28Updated 5 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆93Updated last year