tjuHaoXiaotian / MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
☆14Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for MA-MuZero
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆30Updated 8 months ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆16Updated 6 months ago
- RLA is a tool for managing your RL experiments automatically☆25Updated 3 months ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆44Updated 3 weeks ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆13Updated last month
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆51Updated 5 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆26Updated last year
- ☆17Updated last month
- Google Research Football MARL Benchmark and Research Toolkit☆34Updated 6 months ago
- ☆25Updated 7 months ago
- ☆28Updated last year
- Overcooked human-AI experiment platform☆30Updated 11 months ago
- ☆40Updated 2 years ago
- MATE: the Multi-Agent Tracking Environment.☆43Updated last year
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆38Updated 3 weeks ago
- curriculum☆20Updated last year
- MATE: the Multi-Agent Tracking Environment.☆33Updated last year
- The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".☆16Updated 2 years ago
- ☆15Updated 3 months ago
- ☆11Updated 9 months ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆26Updated 11 months ago
- ☆28Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"