Pytorch Implementation of MuZero
☆353Jul 23, 2023Updated 2 years ago
Alternatives and similar repositories for muzero-pytorch
Users that are interested in muzero-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MuZero☆2,819Sep 3, 2024Updated last year
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆933Dec 20, 2023Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- A structured implementation of MuZero☆206Jun 4, 2022Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 5 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆164Dec 21, 2021Updated 4 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- ☆66Nov 3, 2021Updated 4 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆78Dec 31, 2025Updated 5 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,443Updated this week
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆86Jul 27, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆42Aug 27, 2022Updated 3 years ago
- Library for Model Based RL☆1,062Jul 12, 2024Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆129May 9, 2026Updated 3 weeks ago
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆884Oct 14, 2024Updated last year
- Repository for the paper "Planning to Explore via Self-Supervised World Models"☆238Feb 10, 2023Updated 3 years ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆871Aug 12, 2024Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆544Nov 22, 2022Updated 3 years ago
- ☆364Oct 12, 2022Updated 3 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,591May 12, 2026Updated 2 weeks ago
- Monte Carlo tree search in JAX☆2,626Sep 2, 2025Updated 8 months ago
- Deep Hierarchical Planning from Pixels☆119Dec 21, 2022Updated 3 years ago
- ☆47Sep 24, 2024Updated last year
- RAD: Reinforcement Learning with Augmented Data☆419Mar 29, 2021Updated 5 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆188Apr 12, 2022Updated 4 years ago
- Mastering Atari with Discrete World Models☆1,045Jan 21, 2023Updated 3 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆559Jun 26, 2023Updated 2 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆322Jan 11, 2024Updated 2 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆605Oct 28, 2020Updated 5 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆179Jun 23, 2023Updated 2 years ago
- ☆101Feb 14, 2024Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆601Sep 10, 2021Updated 4 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆238Apr 26, 2023Updated 3 years ago
- Reinforcement Learning in PyTorch☆2,275Jan 4, 2021Updated 5 years ago