Pytorch Implementation of MuZero
☆352Jul 23, 2023Updated 2 years ago
Alternatives and similar repositories for muzero-pytorch
Users that are interested in muzero-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MuZero☆2,791Sep 3, 2024Updated last year
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆928Dec 20, 2023Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- A structured implementation of MuZero☆206Jun 4, 2022Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 5 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆164Dec 21, 2021Updated 4 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- ☆66Nov 3, 2021Updated 4 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆77Dec 31, 2025Updated 2 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,297Updated this week
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆85Jul 27, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆42Aug 27, 2022Updated 3 years ago
- Library for Model Based RL☆1,059Jul 12, 2024Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆125Feb 25, 2026Updated last month
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆872Oct 14, 2024Updated last year
- Repository for the paper "Planning to Explore via Self-Supervised World Models"☆235Feb 10, 2023Updated 3 years ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆868Aug 12, 2024Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆537Nov 22, 2022Updated 3 years ago
- ☆361Oct 12, 2022Updated 3 years ago
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,555Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Monte Carlo tree search in JAX☆2,602Sep 2, 2025Updated 6 months ago
- Deep Hierarchical Planning from Pixels☆118Dec 21, 2022Updated 3 years ago
- ☆47Sep 24, 2024Updated last year
- RAD: Reinforcement Learning with Augmented Data☆417Mar 29, 2021Updated 5 years ago
- Mastering Atari with Discrete World Models☆1,015Jan 21, 2023Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆187Apr 12, 2022Updated 3 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆559Jun 26, 2023Updated 2 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆323Jan 11, 2024Updated 2 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆600Oct 28, 2020Updated 5 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆167Jun 23, 2023Updated 2 years ago