UnrealTracking / ToM2C
The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .
☆54Updated last year
Related projects: ⓘ
- ☆39Updated 3 years ago
- The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆48Updated 9 months ago
- ☆36Updated 2 years ago
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆72Updated 4 months ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆83Updated last year
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆38Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆91Updated 2 years ago
- ☆87Updated 2 years ago
- ☆87Updated 3 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆50Updated last year
- Deep Implicit Coordination Graphs☆40Updated 3 months ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆61Updated 3 years ago
- ☆40Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆89Updated last year
- Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".☆49Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆68Updated 10 months ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆36Updated last year
- ☆44Updated 3 years ago
- ☆28Updated 3 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated last year
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆34Updated 5 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆50Updated 2 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- ☆81Updated 2 years ago
- ☆16Updated last year
- There will be updates later☆79Updated 5 years ago
- Code for Weighted QMIX☆119Updated 3 years ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆25Updated 3 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆54Updated 2 years ago
- Implementation of DyMA-CL, MARL algorithm☆22Updated 4 years ago