tjuHaoXiaotian / MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
☆18Updated last year
Alternatives and similar repositories for MA-MuZero:
Users that are interested in MA-MuZero are comparing it to the libraries listed below
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆27Updated 11 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 4 months ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆34Updated last year
- ☆13Updated 4 months ago
- ☆23Updated 6 months ago
- ☆29Updated last year
- ☆21Updated 8 months ago
- RLA is a tool for managing your RL experiments automatically☆28Updated 3 months ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆19Updated 3 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Implementation of SAC and TD3 based on various RNN and Transformer.☆21Updated 6 months ago
- ☆61Updated 5 months ago
- Overcooked human-AI experiment platform☆37Updated last year
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆40Updated 5 months ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆25Updated 9 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆68Updated 3 months ago
- ☆12Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆53Updated 2 years ago
- MATE: the Multi-Agent Tracking Environment.☆36Updated 2 years ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆31Updated last year
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆31Updated last year
- ☆21Updated last year
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago
- ☆96Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated 2 years ago
- ☆11Updated last year
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆21Updated 3 years ago
- ☆23Updated last year