tjuHaoXiaotian/MA-MuZero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tjuHaoXiaotian/MA-MuZero)

tjuHaoXiaotian / MA-MuZero

MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.

☆23

Alternatives and similar repositories for MA-MuZero

Users that are interested in MA-MuZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xihuai18 / A2PO-ICLR2023
View on GitHub
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆32Nov 22, 2025Updated 8 months ago
tjuHaoXiaotian / Qfamily_for_MatrixGame
View on GitHub
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
☆16Jul 18, 2022Updated 4 years ago
polixir / RLAssistant
View on GitHub
RLA is a tool for managing your RL experiments automatically
☆31Jan 11, 2025Updated last year
mas-group / jshop2
View on GitHub
JSHOP2 planner with support for JIT execution of plans. This is a fork of the original JSHOP2 planner that can be found at https://source…
☆23Aug 17, 2015Updated 10 years ago
tjuHaoXiaotian / ICML-2020-MSBCB
View on GitHub
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
☆27Aug 12, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tjuHaoXiaotian / SC1
View on GitHub
☆24Sep 11, 2018Updated 7 years ago
wkh923 / m3pc
View on GitHub
M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025
☆19Mar 17, 2025Updated last year
Aaron617 / text2world
View on GitHub
[ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation
☆29Feb 25, 2025Updated last year
vint-1 / dreamsmooth
View on GitHub
DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)
☆12May 6, 2024Updated 2 years ago
keep9oing / Sequential-Greedy-Algorithm
View on GitHub
sequential greedy algorithm for multi robot task allocation
☆20Jun 1, 2022Updated 4 years ago
tjuHaoXiaotian / GASIL
View on GitHub
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
☆32Oct 9, 2018Updated 7 years ago
benellis3 / mappo
View on GitHub
☆18Aug 14, 2023Updated 2 years ago
PdIPS / ConsensusBasedX.jl
View on GitHub
A Julia package for consensus-based optimisation
☆16Jul 13, 2026Updated 2 weeks ago
lio-wong / llm-operators
View on GitHub
☆11Oct 29, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Tlntin / booking_simulator
View on GitHub
☆11Jan 6, 2024Updated 2 years ago
spfrommer / flowmatching_policy_rl
View on GitHub
☆23Jul 22, 2025Updated last year
YashBansod / IPyHOP
View on GitHub
IPyHOP is a Re-entrant Iterative GTPyHOP written in Python 3. PyHOP is an acronym for Python Hierarchical Ordered Planner.
☆12Aug 12, 2022Updated 3 years ago
hilookas / astra_ws
View on GitHub
☆25Feb 21, 2026Updated 5 months ago
valeriechen / ask-your-humans
View on GitHub
Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"
☆11Apr 8, 2025Updated last year
Dawn0523 / LAIES
View on GitHub
☆18Jul 14, 2023Updated 3 years ago
Sphere-AI-Lab / poet
View on GitHub
Implementation for POET and POET-X for LLM pretraining
☆38Jun 9, 2026Updated last month
S0daWh1skey / SmartCar-STM32
View on GitHub
基于STM-32的智能循迹避障小车
☆13Jul 4, 2018Updated 8 years ago
anki08 / Option-Critic
View on GitHub
A simple option critic framework using Q-Learning
☆14Feb 7, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
eyounx / PRR
View on GitHub
Meta-Reinforcement Learning with Policy Residual Representation
☆11Aug 15, 2019Updated 6 years ago
menggedu / EDL
View on GitHub
Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics
☆14Mar 6, 2025Updated last year
BorealisAI / llm-pddl-planning
View on GitHub
☆18Feb 20, 2025Updated last year
HMS-IDAC / UnMicst
View on GitHub
UNet script, model, sample data
☆14Feb 19, 2025Updated last year
thethaibinh / agile_flight
View on GitHub
Simulation system for path planning evaluation
☆13Dec 13, 2025Updated 7 months ago
ASU-VDA-Lab / ECO-CHIP
View on GitHub
☆11Mar 3, 2025Updated last year
zhaoyizhou1123 / mbrcsl
View on GitHub
☆11Nov 18, 2023Updated 2 years ago
yeshenpy / EvoRainbow
View on GitHub
(ICML 2024) The official code for EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search
☆38Feb 3, 2026Updated 5 months ago
ChengpengLi1003 / Q-learning
View on GitHub
针对最经典的表格型Q learning算法进行了复现，能够支持gym中大多数的离散动作和状态空间的环境，譬如CliffWalking-v0。
☆10Jan 2, 2021Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
erikon / reinforcement-learning
View on GitHub
CS 188 Project 3
☆11Mar 5, 2018Updated 8 years ago
daomingAU / MontezumaRevenge_SDRL
View on GitHub
☆17Feb 25, 2020Updated 6 years ago
ConesaLab / MOSim
View on GitHub
Bulk and single-cell Multi-Omics ground truth Simulator in R
☆12Feb 10, 2026Updated 5 months ago
JINAN-xxx / gym_super_mario
View on GitHub
本项目旨在探索强化学习技术在经典游戏《超级玛丽》中的应用，通过训练一个智能代理来自主导航并完成游戏关卡。我们采用了深度Q网络（DQN）和双深度Q网络（DDQN）等先进的强化学习算法，结合神经网络，使得代理能够学习如何在游戏世界中生存并获得高分。项目特点强化学习实践：本…
☆18Mar 25, 2026Updated 4 months ago
VAMPIR-Lab / Interstate.jl
View on GitHub
A lightweight driving simulator, written in Julia.
☆19Sep 25, 2024Updated last year
shariqiqbal2810 / Multi-Explore
View on GitHub
Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"
☆37May 22, 2021Updated 5 years ago
TimeBreaker / Adversarial-Reinforcement-Learning-Papers
View on GitHub
Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)
☆76Jul 10, 2026Updated 2 weeks ago