tjuHaoXiaotian / MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
☆17Updated last year
Alternatives and similar repositories for MA-MuZero:
Users that are interested in MA-MuZero are comparing it to the libraries listed below
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆26Updated 10 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 3 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆19Updated 6 months ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆33Updated last year
- ☆29Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆65Updated 2 months ago
- MATE: the Multi-Agent Tracking Environment.☆37Updated 2 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆19Updated 3 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- curriculum☆22Updated 2 years ago
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆40Updated 5 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆34Updated last month
- ☆23Updated 5 months ago
- ☆29Updated 2 years ago
- Google Research Football MARL Benchmark and Research Toolkit☆40Updated 10 months ago
- RLA is a tool for managing your RL experiments automatically☆27Updated 2 months ago
- ☆91Updated last year
- ☆61Updated 4 months ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated 4 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆52Updated 2 years ago
- ☆43Updated 2 years ago
- ☆30Updated last year
- [ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…☆16Updated 10 months ago
- ☆20Updated 7 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆31Updated last year
- ☆13Updated 4 months ago
- ☆25Updated last year
- Model-based Offline Policy Optimization re-implement all by pytorch☆31Updated last year