tjuHaoXiaotian / MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
☆15Updated 11 months ago
Alternatives and similar repositories for MA-MuZero:
Users that are interested in MA-MuZero are comparing it to the libraries listed below
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆23Updated 8 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆26Updated last month
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆31Updated 10 months ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆38Updated 2 months ago
- ☆14Updated 2 years ago
- ☆27Updated 9 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆53Updated 7 months ago
- ☆29Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆25Updated last week
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆11Updated last year
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆28Updated last year
- ☆20Updated 8 months ago
- MATE: the Multi-Agent Tracking Environment.☆44Updated last year
- Google Research Football MARL Benchmark and Research Toolkit☆37Updated 8 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- [TNNLS] PGDQN: A generalized and efficient preference-guided epsilon-greedy policy equipped DQN for Atari and Autonomous Driving☆10Updated last year
- ☆23Updated 11 months ago
- ☆87Updated last year
- Implementation of SAC and TD3 based on various RNN and Transformer.☆18Updated 3 months ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆29Updated 3 years ago
- Overcooked human-AI experiment platform☆32Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆51Updated 2 weeks ago
- ☆11Updated 10 months ago
- ☆41Updated 3 years ago
- curriculum☆20Updated last year
- ☆42Updated 2 years ago
- This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning…☆38Updated 5 months ago
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆16Updated last year
- ☆18Updated 4 months ago
- code for ROMANCE☆13Updated 3 months ago