Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
Alternatives and similar repositories for model-based-rl
Users that are interested in model-based-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆129May 9, 2026Updated last week
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- Classic MCTS example with mctx☆25May 25, 2023Updated 2 years ago
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆22May 6, 2023Updated 3 years ago
- ☆18Aug 24, 2024Updated last year
- Pytorch Implementation of MuZero☆353Jul 23, 2023Updated 2 years ago
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Mar 4, 2024Updated 2 years ago
- Trade using DRL algorithms on tensorflow2 and tf-agents☆11Oct 10, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- ☆39Feb 3, 2026Updated 3 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 10 months ago
- Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning☆50Mar 16, 2026Updated 2 months ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- The homework of robos learning base.☆11May 23, 2023Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- ☆13Apr 28, 2019Updated 7 years ago
- ☆30Apr 30, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Unlock smooth and continuous data generation for robotics with Flow Matching! Transform simple noise into precise, fluid robot actions an…☆23Jan 17, 2025Updated last year
- Double Q-learning reinforcement learning agent on NES Super Mario Bros☆43May 4, 2019Updated 7 years ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆40Apr 14, 2026Updated last month
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆110Jan 23, 2022Updated 4 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆78Dec 31, 2025Updated 4 months ago
- ☆13Apr 25, 2024Updated 2 years ago
- ♟️ Vectorized RL game environments in JAX☆607Mar 6, 2025Updated last year
- Model-based Hindsight Experience Replay☆10Jun 8, 2022Updated 3 years ago
- An assemble of various world model including dreamer v2 and v3☆10Sep 9, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A C++ pytorch implementation of MuZero☆40Updated this week
- cmdr cxx version, a C++17/20 header-only command-line parser with hierarchical config data manager here☆18Apr 23, 2026Updated 3 weeks ago
- ☆35Dec 5, 2022Updated 3 years ago
- DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation (NeurIPS 2024)☆44Feb 18, 2025Updated last year
- Repository for IROS 2019☆28Nov 16, 2019Updated 6 years ago
- Applying PBT optimization technique to different domains☆10Oct 16, 2019Updated 6 years ago
- Official codebase for Adaptive Online Planning for Continual Lifelong Learning.☆17Mar 26, 2020Updated 6 years ago