roger-creus / stable-deep-rl-at-scaleLinks
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments!
☆33Updated 2 months ago
Alternatives and similar repositories for stable-deep-rl-at-scale
Users that are interested in stable-deep-rl-at-scale are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆92Updated last year
- ☆52Updated 2 years ago
- ☆114Updated 10 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112Updated last year
- ☆43Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆120Updated 3 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆82Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆92Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆81Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆22Updated last year
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆82Updated last month
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆81Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆104Updated 2 months ago
- ☆23Updated 3 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- ☆30Updated last year
- Reinforcement Learning via Supervised Learning☆72Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆21Updated 11 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆27Updated 2 years ago
- Transformer-based World Models☆86Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆112Updated 3 years ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆40Updated last year
- ☆60Updated 2 weeks ago
- ☆52Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 5 months ago
- Conservative Q learning in Jax☆56Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)