Zeta36/muzero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Zeta36/muzero)

Zeta36 / muzero

A simple implementation of MuZero algorithm for connect4 game

☆96

Alternatives and similar repositories for muzero

Users that are interested in muzero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

johan-gras / MuZero
View on GitHub
A structured implementation of MuZero
☆205Jun 4, 2022Updated 4 years ago
koulanurag / muzero-pytorch
View on GitHub
Pytorch Implementation of MuZero
☆356Jul 23, 2023Updated 3 years ago
YuriCat / MuZeroJupyterExample
View on GitHub
☆66Nov 3, 2021Updated 4 years ago
wulfebw / muzero
View on GitHub
A python implemenation of tabular MuZero for educational purposes
☆21Dec 11, 2019Updated 6 years ago
werner-duvaud / muzero-general
View on GitHub
MuZero
☆2,844Sep 3, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
kaesve / muzero
View on GitHub
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆169Mar 28, 2021Updated 5 years ago
fidel-schaposnik / muzero
View on GitHub
Tensorflow implementation of MuZero algorithm
☆11Aug 23, 2022Updated 3 years ago
JimOhman / model-based-rl
View on GitHub
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago
LucasAlegre / sac-plus
View on GitHub
Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).
☆15Feb 21, 2021Updated 5 years ago
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆137Jul 17, 2026Updated last week
epignatelli / discovering-reinforcement-learning-algorithms
View on GitHub
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆23Dec 22, 2020Updated 5 years ago
NGRThomson / ADL_RL
View on GitHub
Advanced Deep Learning and Reinforcement Learning 2018 Assignments
☆18Nov 24, 2018Updated 7 years ago
kelechi-c / dit_flow
View on GitHub
DiT (training + flow matching) in Jax
☆12Jan 5, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ituvisionlab / EdVAE
View on GitHub
Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"
☆14Sep 20, 2024Updated last year
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
aicenter / TensorCFR
View on GitHub
☆10Feb 28, 2019Updated 7 years ago
CodeAltus / Snake-AI
View on GitHub
Using the A* pathfinding algorithm to play the classic snake game perfectly
☆17May 20, 2021Updated 5 years ago
gabrieledcjr / DeepRL
View on GitHub
☆19Mar 28, 2019Updated 7 years ago
huangeddie / GymGo
View on GitHub
An environment of the board game Go using OpenAI's Gym API
☆176May 3, 2022Updated 4 years ago
takuseno / d4rl-pybullet
View on GitHub
Datasets for data-driven deep reinforcement learning with PyBullet environments
☆152Mar 19, 2021Updated 5 years ago
seungjaeryanlee / rl-exploration
View on GitHub
Reinforcement Learning papers on exploration methods.
☆19Jun 27, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GallagherCommaJack / modulax
View on GitHub
☆18Aug 24, 2024Updated last year
BerkeleyAutomation / Urban_Driving_Simulator
View on GitHub
FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.
☆24May 3, 2019Updated 7 years ago
wangyuhuix / TRGPPO
View on GitHub
☆34Nov 21, 2022Updated 3 years ago
instadeepai / AlphaNPI
View on GitHub
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Oct 3, 2023Updated 2 years ago
elsheikh21 / population-based-training-of-NNs
View on GitHub
Applying PBT optimization technique to different domains
☆10Oct 16, 2019Updated 6 years ago
gaflach / usizer
View on GitHub
discrete gate sizing
☆14Nov 23, 2020Updated 5 years ago
you68681 / GPAR
View on GitHub
☆23Apr 4, 2024Updated 2 years ago
krocki / mcts_mpi
View on GitHub
GPU Monte Carlo Tree Search with MPI
☆26Jan 9, 2019Updated 7 years ago
microsoft / coax
View on GitHub
This project was moved to: https://github.com/coax-dev/coax
☆161Nov 28, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ianfab / chess-variant-puzzler
View on GitHub
Puzzle generator for chess variants
☆19Jul 1, 2026Updated 3 weeks ago
dannysdeng / dqn-pytorch
View on GitHub
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Jul 26, 2019Updated 7 years ago
SimonOuellette35 / BayesianRL
View on GitHub
Some example code for the "Introduction to Bayesian Reinforcement Learning" presentations
☆29Feb 15, 2019Updated 7 years ago
dimarkov / pybefit
View on GitHub
Probabilistic inference for models of behaviour
☆13Mar 5, 2026Updated 4 months ago
pfnet-research / capg
View on GitHub
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Jun 24, 2018Updated 8 years ago
zchuning / repo
View on GitHub
Resilient Model-Based RL by Regularizing Posterior Predictability
☆22Mar 4, 2024Updated 2 years ago
RobertTLange / spinningup-workspace
View on GitHub
Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.
☆40Dec 8, 2022Updated 3 years ago