[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆131May 9, 2026Updated last month
Alternatives and similar repositories for minizero
Users that are interested in minizero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆27May 18, 2025Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆78Dec 31, 2025Updated 5 months ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A C++ pytorch implementation of MuZero☆40May 18, 2026Updated last month
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆115Aug 9, 2024Updated last year
- fast + parallel AlphaZero in JAX☆111Dec 22, 2024Updated last year
- Pytorch Implementation of MuZero☆355Jul 23, 2023Updated 2 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 11 months ago
- MuZero☆2,828Sep 3, 2024Updated last year
- ♟️ Vectorized RL game environments in JAX☆620Mar 6, 2025Updated last year
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- [ICLR 2026] From Observations to Events: Event-Aware World Models for Reinforcement Learning☆46May 30, 2026Updated 2 weeks ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- AlphaZero in JAX☆82Apr 3, 2024Updated 2 years ago
- ☆55Apr 11, 2023Updated 3 years ago
- fast + parallel AlphaZero in PyTorch☆15Jan 21, 2024Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 5 years ago
- ☆47Jan 29, 2024Updated 2 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆43Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆26Apr 16, 2024Updated 2 years ago
- ☆13Apr 25, 2024Updated 2 years ago
- A collection of notebooks aiding the understanding of machine-learning papers.☆10Apr 5, 2021Updated 5 years ago
- Monte Carlo tree search in JAX☆2,632Updated this week
- AI Shell☆16Feb 1, 2024Updated 2 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆46Dec 27, 2022Updated 3 years ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆45May 8, 2024Updated 2 years ago
- Code for "DeepPolar codes", ICML 2024☆12May 7, 2024Updated 2 years ago
- A PyTorch native library for large model training☆30Apr 1, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Efficient baselines for autocurricula in JAX.☆214Aug 24, 2024Updated last year
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆840Jun 3, 2026Updated 2 weeks ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆934Dec 20, 2023Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆91Oct 15, 2023Updated 2 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- Official repo of LookWhere (NeurIPS 2025) for efficient high-res visual recognition☆16Oct 23, 2025Updated 7 months ago
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago