Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
Alternatives and similar repositories for model-based-rl
Users that are interested in model-based-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆19Jun 30, 2021Updated 4 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆127Feb 25, 2026Updated 2 months ago
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated 11 months ago
- ☆18Aug 24, 2024Updated last year
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- MuZero☆2,801Sep 3, 2024Updated last year
- Pytorch Implementation of MuZero☆353Jul 23, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11May 15, 2020Updated 5 years ago
- Trade using DRL algorithms on tensorflow2 and tf-agents☆11Oct 10, 2025Updated 6 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- ☆36Feb 3, 2026Updated 2 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 10 months ago
- The homework of robos learning base.☆11May 23, 2023Updated 2 years ago
- Distributed Graph Mining on a Massive "Single" Graph☆15Mar 28, 2020Updated 6 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- ☆28Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Apr 28, 2019Updated 7 years ago
- Haskell to D3.js binding by deep EDSL approach.☆23Sep 20, 2014Updated 11 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Double Q-learning reinforcement learning agent on NES Super Mario Bros☆42May 4, 2019Updated 6 years ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆40Apr 14, 2026Updated 2 weeks ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆110Jan 23, 2022Updated 4 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆77Dec 31, 2025Updated 4 months ago
- ☆10Apr 5, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Jan 16, 2025Updated last year
- ♟️ Vectorized RL game environments in JAX☆602Mar 6, 2025Updated last year
- ☆17Jan 6, 2024Updated 2 years ago
- An assemble of various world model including dreamer v2 and v3☆10Sep 9, 2023Updated 2 years ago
- A C++ pytorch implementation of MuZero☆40May 1, 2024Updated 2 years ago
- cmdr cxx version, a C++17/20 header-only command-line parser with hierarchical config data manager here☆18Apr 23, 2026Updated last week
- An implementation of a Brownian motion using ClojureScript with re-frame and Highcharts☆11Feb 8, 2019Updated 7 years ago