Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
☆78Dec 31, 2025Updated 6 months ago
Alternatives and similar repositories for Stochastic-muzero
Users that are interested in Stochastic-muzero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated last year
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- A C++ pytorch implementation of MuZero☆40May 18, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆133May 9, 2026Updated last month
- Pytorch Implementation of MuZero☆356Jul 23, 2023Updated 2 years ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆115Aug 9, 2024Updated last year
- ♟️ Vectorized RL game environments in JAX☆621Mar 6, 2025Updated last year
- Advantage Alignment Algorithms (ICLR 2025 oral)☆20Apr 7, 2025Updated last year
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆20Jun 24, 2026Updated last week
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆936Dec 20, 2023Updated 2 years ago
- MuZero☆2,836Sep 3, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆169Mar 28, 2021Updated 5 years ago
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated last year
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- Code for our TMLR paper "Distributional GFlowNets with Quantile Flows".☆13Feb 14, 2024Updated 2 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆24Nov 18, 2022Updated 3 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated 2 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆37Dec 1, 2023Updated 2 years ago
- ☆55Apr 11, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deep memory and sequence models in JAX☆32Jun 8, 2026Updated 3 weeks ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Cost-aware Bayesian optimization via the Pandora's box Gittins index☆13Aug 8, 2025Updated 10 months ago
- ☆14Aug 18, 2023Updated 2 years ago
- ☆96Feb 16, 2026Updated 4 months ago
- Monte Carlo tree search in JAX☆2,634Jun 15, 2026Updated 2 weeks ago
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆14Nov 15, 2023Updated 2 years ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆412Mar 18, 2026Updated 3 months ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Swarm learning algorithm☆11Jun 2, 2021Updated 5 years ago
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- ☆18Nov 4, 2021Updated 4 years ago
- Implementation of UltraMem, improved Product Key Memory design, from Bytedance AI labs☆28Nov 4, 2025Updated 7 months ago
- Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".☆27Jul 6, 2023Updated 2 years ago
- Incorporating Neuro-Inspired Adaptability for Continual Learning in Artificial Intelligence☆28Dec 12, 2023Updated 2 years ago
- Official implementation of ICML'24 paper "Offline Multi-Objective Optimization".☆25May 24, 2026Updated last month