rlglab / minizeroView external linksLinks
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆121Updated this week
Alternatives and similar repositories for minizero
Users that are interested in minizero are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆22May 18, 2025Updated 8 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆64Nov 14, 2024Updated last year
- A C++ pytorch implementation of MuZero☆40May 1, 2024Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 7 months ago
- An implementation of MuZero in JAX.☆57Nov 8, 2022Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Dec 31, 2025Updated last month
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆101Aug 9, 2024Updated last year
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,530Updated this week
- fast + parallel AlphaZero in JAX☆109Dec 22, 2024Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆26May 2, 2025Updated 9 months ago
- Pytorch Implementation of MuZero☆352Jul 23, 2023Updated 2 years ago
- MuZero☆2,766Sep 3, 2024Updated last year
- Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)☆19Mar 18, 2024Updated last year
- ☆46Jan 29, 2024Updated 2 years ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- ☆53Apr 11, 2023Updated 2 years ago
- AlphaZero in JAX☆81Apr 3, 2024Updated last year
- ♟️ Vectorized RL game environments in JAX☆585Mar 6, 2025Updated 11 months ago
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 4 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆46Dec 27, 2022Updated 3 years ago
- ☆24Apr 16, 2024Updated last year
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- Code for "DeepPolar codes", ICML 2024☆12May 7, 2024Updated last year
- mcts-simple is a Python3 library that implements Monte Carlo Tree Search and its variants to solve a host of problems, most commonly for …☆31Aug 8, 2025Updated 6 months ago
- Evaluation of TD-MPC2.☆21Jan 21, 2024Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- ☆13Apr 25, 2024Updated last year
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago
- Unity로 멀티 에이전트 강화학습(MARL) 수행하기 위한 프레임 워크 제공☆24Apr 17, 2022Updated 3 years ago
- ☆16Feb 1, 2022Updated 4 years ago
- Monte Carlo tree search in JAX☆2,589Sep 2, 2025Updated 5 months ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆41Jul 24, 2025Updated 6 months ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆38Nov 18, 2023Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆923Dec 20, 2023Updated 2 years ago
- Efficient baselines for autocurricula in JAX.☆206Aug 24, 2024Updated last year
- [SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search☆13Nov 3, 2021Updated 4 years ago