Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
☆926Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for EfficientZero
Users that are interested in EfficientZero are comparing it to the libraries listed below
Sorting:
- MuZero☆2,785Sep 3, 2024Updated last year
- Pytorch Implementation of MuZero☆352Jul 23, 2023Updated 2 years ago
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆871Oct 14, 2024Updated last year
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,279Aug 12, 2024Updated last year
- ☆361Oct 12, 2022Updated 3 years ago
- Monte Carlo tree search in JAX☆2,600Sep 2, 2025Updated 6 months ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆867Aug 12, 2024Updated last year
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,543Mar 11, 2026Updated last week
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆164Dec 21, 2021Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 4 years ago
- An implementation of MuZero in JAX.☆57Nov 8, 2022Updated 3 years ago
- Mastering Atari with Discrete World Models☆1,014Jan 21, 2023Updated 3 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆599Oct 28, 2020Updated 5 years ago
- Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.☆2,777Apr 29, 2024Updated last year
- Library for Model Based RL☆1,058Jul 12, 2024Updated last year
- Mastering Diverse Domains through World Models☆2,928Sep 23, 2025Updated 5 months ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆106Aug 9, 2024Updated last year
- DrQ-v2: Improved Data-Augmented Reinforcement Learning☆431May 31, 2022Updated 3 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆88Jan 24, 2024Updated 2 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆237Apr 26, 2023Updated 2 years ago
- RL Environments in JAX 🌍☆871May 30, 2025Updated 9 months ago
- Benchmarking the Spectrum of Agent Capabilities☆528Jan 23, 2024Updated 2 years ago
- Implementation of Dreamer v3 in pytorch.☆820Mar 8, 2026Updated 2 weeks ago
- ☆330Dec 19, 2024Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆165Jun 23, 2023Updated 2 years ago
- A collection of reference environments for offline reinforcement learning☆1,662Nov 18, 2024Updated last year
- Code for the paper "Phasic Policy Gradient"☆268Apr 2, 2023Updated 2 years ago
- ☆1,411Mar 2, 2026Updated 2 weeks ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112May 27, 2024Updated last year
- High throughput synchronous and asynchronous reinforcement learning☆976Jan 29, 2026Updated last month
- A project that provides help for using DeepMind's mctx on gym-style environments.☆65Nov 14, 2024Updated last year
- bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent☆1,531Apr 13, 2024Updated last year
- High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, T…☆9,327Jul 8, 2025Updated 8 months ago
- Simple and easily configurable grid world environments for reinforcement learning☆2,412Mar 2, 2026Updated 2 weeks ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆77Dec 31, 2025Updated 2 months ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆753Oct 26, 2022Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆84Jul 27, 2022Updated 3 years ago