[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆127Feb 25, 2026Updated 2 months ago
Alternatives and similar repositories for minizero
Users that are interested in minizero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆77Dec 31, 2025Updated 4 months ago
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,577Apr 29, 2026Updated last week
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AN USI Compliant Tsumeshogi Engine☆21Feb 10, 2026Updated 2 months ago
- A C++ pytorch implementation of MuZero☆40May 1, 2024Updated 2 years ago
- fast + parallel AlphaZero in JAX☆111Dec 22, 2024Updated last year
- Pytorch Implementation of MuZero☆353Jul 23, 2023Updated 2 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 10 months ago
- MuZero☆2,807Sep 3, 2024Updated last year
- Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)☆19Mar 18, 2024Updated 2 years ago
- ♟️ Vectorized RL game environments in JAX☆603Mar 6, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- AlphaZero in JAX☆83Apr 3, 2024Updated 2 years ago
- ☆55Apr 11, 2023Updated 3 years ago
- fast + parallel AlphaZero in PyTorch☆15Jan 21, 2024Updated 2 years ago
- Evaluation of TD-MPC2.☆21Jan 21, 2024Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 5 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆42Jul 24, 2025Updated 9 months ago
- Reversi solver on Rust☆11Dec 28, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆25Apr 16, 2024Updated 2 years ago
- ☆13Apr 25, 2024Updated 2 years ago
- Monte Carlo tree search in JAX☆2,619Sep 2, 2025Updated 8 months ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆43May 8, 2024Updated 2 years ago
- ☆12Apr 10, 2017Updated 9 years ago
- Code for "DeepPolar codes", ICML 2024☆12May 7, 2024Updated 2 years ago
- mcts-simple is a Python3 library that implements Monte Carlo Tree Search and its variants to solve a host of problems, most commonly for …☆32Aug 8, 2025Updated 9 months ago
- Efficient baselines for autocurricula in JAX.☆212Aug 24, 2024Updated last year
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆828Apr 13, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆932Dec 20, 2023Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆89Oct 15, 2023Updated 2 years ago
- Explorations into NEAT and some of its derivative research☆37Apr 17, 2026Updated 3 weeks ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago
- (3DV 2026) Pytorch implementation of “InterPose: Learning to Generate Human-Object Interactions from Large-Scale Web Videos”☆26Mar 16, 2026Updated last month
- Car racing RL agents in actual F1 tracks☆17Oct 22, 2024Updated last year