[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
☆115Aug 9, 2024Updated last year
Alternatives and similar repositories for EfficientZeroV2
Users that are interested in EfficientZeroV2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆27May 18, 2025Updated last year
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆934Dec 20, 2023Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆78Dec 31, 2025Updated 5 months ago
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,607Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 11 months ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆166Dec 21, 2021Updated 4 years ago
- Advantage Alignment Algorithms (ICLR 2025 oral)☆20Apr 7, 2025Updated last year
- Implementation of Dreamer v3 in pytorch.☆858Mar 8, 2026Updated 3 months ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- Open-source codebase for PaMoRL, from "Parallelizing Model-based Reinforcement Learning Over the Sequence Length" at NeurIPS 2024.☆14Dec 17, 2024Updated last year
- ☆138Mar 18, 2026Updated 2 months ago
- [ICLR 2026] From Observations to Events: Event-Aware World Models for Reinforcement Learning☆45May 30, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pytorch Implementation of MuZero☆355Jul 23, 2023Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 5 years ago
- ☆127Feb 25, 2025Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Scalable metrics logging and analysis☆18Updated this week
- ☆23Apr 2, 2024Updated 2 years ago
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆25May 11, 2024Updated 2 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆153Apr 7, 2026Updated 2 months ago
- Transformer-based World Models☆90Apr 4, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated list of awesome model based RL resources (continually updated)☆1,364May 21, 2026Updated 3 weeks ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆862May 21, 2025Updated last year
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆26Feb 20, 2025Updated last year
- Reinforcement learning library written in Rust☆24Aug 3, 2017Updated 8 years ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆126Sep 22, 2024Updated last year
- ☆43Apr 19, 2026Updated last month
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- Single player Alpha Zero implementation☆42Mar 7, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆96Jun 4, 2024Updated 2 years ago
- PyTorch implementation of DreamerV3, Mastering Diverse Domains through World Models.☆10Feb 16, 2024Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆63Apr 4, 2023Updated 3 years ago
- ☆401Feb 13, 2023Updated 3 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Mar 4, 2024Updated 2 years ago