kaesve/muzero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kaesve/muzero)

kaesve / muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

☆169

Alternatives and similar repositories for muzero

Users that are interested in muzero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

werner-duvaud / muzero-general
View on GitHub
MuZero
☆2,844Sep 3, 2024Updated last year
johan-gras / MuZero
View on GitHub
A structured implementation of MuZero
☆205Jun 4, 2022Updated 4 years ago
koulanurag / muzero-pytorch
View on GitHub
Pytorch Implementation of MuZero
☆356Jul 23, 2023Updated 3 years ago
fidel-schaposnik / muzero
View on GitHub
Tensorflow implementation of MuZero algorithm
☆11Aug 23, 2022Updated 3 years ago
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Zeta36 / muzero
View on GitHub
A simple implementation of MuZero algorithm for connect4 game
☆96Aug 11, 2020Updated 5 years ago
YeWR / EfficientZero
View on GitHub
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
☆938Dec 20, 2023Updated 2 years ago
pikaju / muzero-g
View on GitHub
Reference implementation for the paper titled "Improving Model-Based Reinforcement Learning with Internal State Representations through S…
☆12Feb 10, 2021Updated 5 years ago
tuero / muzero-cpp
View on GitHub
A C++ pytorch implementation of MuZero
☆40May 18, 2026Updated 2 months ago
Hwhitetooth / jax_muzero
View on GitHub
An implementation of MuZero in JAX.
☆58Nov 8, 2022Updated 3 years ago
mabirck / AttentionTRL
View on GitHub
Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
☆10Jan 9, 2018Updated 8 years ago
DHDev0 / Stochastic-muzero
View on GitHub
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆79Dec 31, 2025Updated 6 months ago
mila-iqia / spr
View on GitHub
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆167Dec 21, 2021Updated 4 years ago
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
diogohmcruz / DeepDip
View on GitHub
DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA
☆13Jul 22, 2019Updated 7 years ago
facebookresearch / rebel
View on GitHub
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
☆700Mar 20, 2024Updated 2 years ago
sotetsuk / pgx
View on GitHub
♟️ Vectorized RL game environments in JAX
☆634Mar 6, 2025Updated last year
Shengjiewang-Jason / EfficientZeroV2
View on GitHub
[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
☆121Aug 9, 2024Updated last year
opendilab / LightZero
View on GitHub
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…
☆1,626Jul 17, 2026Updated last week
google-deepmind / dqn_zoo
View on GitHub
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…
☆509Jul 20, 2026Updated last week
RobertTLange / gymnax-blines
View on GitHub
Baselines for gymnax 🤖
☆78Apr 3, 2023Updated 3 years ago
sail-sg / rosmo
View on GitHub
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 3 years ago
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆137Jul 17, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ztjhz / t5-jax
View on GitHub
JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
☆24Jun 10, 2023Updated 3 years ago
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
tmoer / a0c
View on GitHub
Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
google-deepmind / mctx
View on GitHub
Monte Carlo tree search in JAX
☆2,649Jul 9, 2026Updated 2 weeks ago
kevaday / alphazero-general
View on GitHub
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆90Dec 11, 2024Updated last year
mila-iqia / SGI
View on GitHub
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆56Jul 27, 2021Updated 5 years ago
facebookresearch / mbrl-lib
View on GitHub
Library for Model Based RL
☆1,065Jul 12, 2024Updated 2 years ago
michalsustr / spielviz
View on GitHub
SpielViz is an interactive viewer for OpenSpiel games.
☆37May 14, 2024Updated 2 years ago
wulfebw / muzero
View on GitHub
A python implemenation of tabular MuZero for educational purposes
☆21Dec 11, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mohmdelsayed / HesScale
View on GitHub
Scalable Computation of Hessian Diagonals
☆14Jun 2, 2024Updated 2 years ago
timoklein / alphazero-gym
View on GitHub
AlphaZero for continuous control tasks
☆23Dec 7, 2022Updated 3 years ago
evgenii-nikishin / omd
View on GitHub
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Jun 14, 2021Updated 5 years ago
enpasos / muzero
View on GitHub
☆14Jan 16, 2025Updated last year
jonathan-laurent / AlphaZero.jl
View on GitHub
A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.
☆1,330Apr 11, 2026Updated 3 months ago
google-deepmind / open_spiel
View on GitHub
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
☆5,368Jul 17, 2026Updated last week
fabricerosay / AlphaGPU
View on GitHub
Alphazero on GPU thanks to CUDA.jl
☆34Aug 30, 2021Updated 4 years ago