Shengjiewang-Jason/EfficientZeroV2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shengjiewang-Jason/EfficientZeroV2)

Shengjiewang-Jason / EfficientZeroV2

[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

☆120

Alternatives and similar repositories for EfficientZeroV2

Users that are interested in EfficientZeroV2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rlglab / optionzero
View on GitHub
[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm
☆28May 18, 2025Updated last year
YeWR / EfficientZero
View on GitHub
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
☆939Dec 20, 2023Updated 2 years ago
weipu-zhang / STORM
View on GitHub
☆142Mar 18, 2026Updated 4 months ago
DHDev0 / Muzero-unplugged
View on GitHub
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆36Jun 25, 2025Updated last year
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆136Jul 17, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
burchim / TWISTER
View on GitHub
[ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)
☆57Mar 9, 2025Updated last year
DAVIAN-Robotics / SimbaV2
View on GitHub
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆108Nov 4, 2025Updated 8 months ago
SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
opendilab / LightZero
View on GitHub
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…
☆1,625Jul 17, 2026Updated last week
sail-sg / rosmo
View on GitHub
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 3 years ago
mila-iqia / spr
View on GitHub
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆167Dec 21, 2021Updated 4 years ago
DHDev0 / Stochastic-muzero
View on GitHub
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆79Dec 31, 2025Updated 6 months ago
jrobine / twm
View on GitHub
Transformer-based World Models
☆90Apr 4, 2023Updated 3 years ago
naumix / BiggerRegularizedCategorical
View on GitHub
☆17Apr 23, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
werner-duvaud / muzero-general
View on GitHub
MuZero
☆2,845Sep 3, 2024Updated last year
Hwhitetooth / jax_muzero
View on GitHub
An implementation of MuZero in JAX.
☆58Nov 8, 2022Updated 3 years ago
NM512 / dreamerv3-torch
View on GitHub
Implementation of Dreamer v3 in pytorch.
☆885Mar 8, 2026Updated 4 months ago
jianzhnie / RLZero
View on GitHub
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
☆17Oct 15, 2024Updated last year
nicklashansen / tdmpc2
View on GitHub
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
☆900Jul 13, 2026Updated last week
naumix / BiggerRegularizedOptimistic
View on GitHub
Official implementation of the BRO algorithm
☆61Jan 29, 2025Updated last year
tinker495 / jax-baseline
View on GitHub
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆67Updated this week
facebookresearch / MRQ
View on GitHub
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆154Apr 7, 2026Updated 3 months ago
koulanurag / muzero-pytorch
View on GitHub
Pytorch Implementation of MuZero
☆356Jul 23, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
danijar / dreamerv3
View on GitHub
Mastering Diverse Domains through World Models
☆3,595May 25, 2026Updated 2 months ago
realwenlongwang / Drama
View on GitHub
[ICLR 2025] Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficien. The frist Mamba/Mamba2 MBRL agent.
☆36May 24, 2026Updated 2 months ago
pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
kaesve / muzero
View on GitHub
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆169Mar 28, 2021Updated 5 years ago
bwfbowen / muax
View on GitHub
A project that provides help for using DeepMind's mctx on gym-style environments.
☆66Nov 14, 2024Updated last year
maximilianigl / rl-iter
View on GitHub
Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
☆11Jun 8, 2020Updated 6 years ago
opendilab / awesome-model-based-RL
View on GitHub
A curated list of awesome model based RL resources (continually updated)
☆1,385May 21, 2026Updated 2 months ago
mila-iqia / SGI
View on GitHub
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆56Jul 27, 2021Updated 4 years ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
Wongziseoi / PaMoRL
View on GitHub
Open-source codebase for PaMoRL, from "Parallelizing Model-based Reinforcement Learning Over the Sequence Length" at NeurIPS 2024.
☆14Dec 17, 2024Updated last year
YeWR / RLFP
View on GitHub
RLFP (CoRL 2024)
☆14Oct 11, 2024Updated last year
vmicheli / delta-iris
View on GitHub
Efficient World Models with Context-Aware Tokenization. ICML 2024
☆129Sep 22, 2024Updated last year
dmksjfl / PAR
View on GitHub
Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)
☆15Aug 15, 2025Updated 11 months ago
facebookresearch / drqv2
View on GitHub
DrQ-v2: Improved Data-Augmented Reinforcement Learning
☆438May 31, 2022Updated 4 years ago
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago