Shengjiewang-Jason / EfficientZeroV2
[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
☆75Updated 8 months ago
Alternatives and similar repositories for EfficientZeroV2:
Users that are interested in EfficientZeroV2 are comparing it to the libraries listed below
- ☆81Updated 10 months ago
- Transformer-based World Models☆78Updated 2 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆83Updated 2 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆78Updated 4 months ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 9 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆90Updated 8 months ago
- Repo for Implicit Diffusion Q-Learning☆104Updated last year
- ☆80Updated last month
- JAX implementation of WSRL and RL baselines | ICLR 2025☆36Updated 2 weeks ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆82Updated 11 months ago
- Benchmarked implementations of Offline RL Algorithms.☆72Updated last month
- ☆270Updated 2 years ago
- Goal-Conditioned Reinforcement Learning with JAX☆142Updated 2 weeks ago
- A benchmark for offline goal-conditioned RL and offline RL☆145Updated last week
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆110Updated 7 months ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆41Updated last week
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated 5 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 10 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆25Updated last year
- Meta-RL Model-Based Algorithm☆31Updated 10 months ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆37Updated 2 years ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆35Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆69Updated 10 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆110Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆117Updated this week
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆74Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆141Updated last year
- Reinforcement Learning via Supervised Learning☆71Updated 2 years ago
- ☆260Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 2 years ago