Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.
☆43May 8, 2024Updated 2 years ago
Alternatives and similar repositories for MAZero
Users that are interested in MAZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…☆24May 29, 2024Updated last year
- Code for ICML25 paper "HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning"☆24Nov 11, 2025Updated 5 months ago
- ☆13Jan 17, 2022Updated 4 years ago
- LAMARL: LLM-Aided Multi-Agent Reinforcement Learning for Cooperative Policy Generation☆49Jul 19, 2025Updated 9 months ago
- A plotter for reinforcement learning (RL) using Weights & Biases☆15Dec 20, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS 2024] Doubly Mild Generalization for Offline Reinforcement Learning☆16Oct 29, 2025Updated 6 months ago
- ☆20Aug 15, 2023Updated 2 years ago
- A pytorch implementation of Dreamer☆24Mar 13, 2023Updated 3 years ago
- Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.☆12Feb 25, 2025Updated last year
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆42Jul 24, 2025Updated 9 months ago
- (Official) PyTorch implementation for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning (EMU) (ICLR…☆56May 23, 2024Updated last year
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- Official implementation of HARL algorithms based on PyTorch.☆900Apr 27, 2025Updated last year
- ADP☆12Apr 12, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks☆53Apr 29, 2026Updated last week
- ☆23Dec 22, 2024Updated last year
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆25Feb 10, 2024Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆59Dec 27, 2023Updated 2 years ago
- Offical code for Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning☆30Sep 1, 2024Updated last year
- [ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl☆22Jan 14, 2024Updated 2 years ago
- Automated and fast parsing of local project directories and GitHub directories, one-click deployment of local parsing with AutoGPT(自动化快速解…☆26Sep 5, 2024Updated last year
- [ICLR'26] MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆45Apr 17, 2026Updated 3 weeks ago
- This is a code demo for the paper "Few-shot Hyperspectral Image Classification with Self-supervised Learning"☆23Oct 1, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆155Apr 24, 2025Updated last year
- Implementation of Continuous Control RL Algorithms☆11Dec 8, 2022Updated 3 years ago
- Code for the paper "Collaborative Target Search with a Visual Drone Swarm: Adaptive Curriculum Embedded Multistage Reinforcement Learnin…☆58Nov 25, 2023Updated 2 years ago
- [CVPR 2023] PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations☆52Oct 25, 2024Updated last year
- [ICML 2025] Official Implementation of GLIDER☆74Oct 9, 2025Updated 6 months ago
- ☆35Apr 11, 2023Updated 3 years ago
- ☆13May 10, 2021Updated 4 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 5 months ago
- [ICDE 2023] Exploring both Individuality and Cooperation for Air-Ground Spatial Crowdsourcing by Multi-Agent Deep Reinforcement Learning☆27Nov 30, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆27Feb 24, 2024Updated 2 years ago
- Cooperation and Fairness in Multi-Agent Reinforcement Learning☆16Aug 6, 2025Updated 9 months ago
- ZJU Robotics project of differential drive car path planning and trajectory planning based on the Client simulation platform (my freshman…☆10Dec 2, 2020Updated 5 years ago
- ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective☆57Mar 11, 2024Updated 2 years ago
- Code for Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken, ECCV 2024☆26Jul 14, 2024Updated last year
- Aerial Combat environment build around PyFlyt☆12Aug 12, 2023Updated 2 years ago
- Harnessing reinforcement learning, this repository emulates drone flocking behavior inspired by biological models. This uses a 2D environ…☆11Nov 4, 2023Updated 2 years ago