roger-creus / stable-deep-rl-at-scaleView external linksLinks
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments!
☆35Oct 24, 2025Updated 3 months ago
Alternatives and similar repositories for stable-deep-rl-at-scale
Users that are interested in stable-deep-rl-at-scale are comparing it to the libraries listed below
Sorting:
- A framework for evaluating LLMs in Atari games☆15Apr 21, 2025Updated 9 months ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- ☆119Feb 25, 2025Updated 11 months ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆91Nov 4, 2025Updated 3 months ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Jul 21, 2025Updated 6 months ago
- A scalable benchmark for state representation learning in visual reinforcement learning.☆16Jun 23, 2025Updated 7 months ago
- Code for "When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?"☆14Dec 19, 2024Updated last year
- Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.☆35Feb 9, 2026Updated last week
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆41Jun 5, 2025Updated 8 months ago
- ☆60Jan 30, 2026Updated 2 weeks ago
- ☆28Aug 19, 2024Updated last year
- ☆13May 21, 2023Updated 2 years ago
- ☆19May 20, 2025Updated 8 months ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆22Jan 14, 2025Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆143Jun 23, 2025Updated 7 months ago
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆49Jun 27, 2024Updated last year
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17May 1, 2025Updated 9 months ago
- A benchmark to test and compare your pcg algorithm against each other☆22Sep 22, 2025Updated 4 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆78May 27, 2024Updated last year
- Learning Robust Dynamics Through Variational Sparse Gating☆20Oct 19, 2022Updated 3 years ago
- ☆267Nov 28, 2025Updated 2 months ago
- ☆24Jan 26, 2024Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆27May 22, 2023Updated 2 years ago
- ☆23Apr 2, 2024Updated last year
- Official implementation of the BRO algorithm☆54Jan 29, 2025Updated last year
- krazy grid world☆25Mar 2, 2020Updated 5 years ago
- ☆423Oct 12, 2025Updated 4 months ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆104May 17, 2022Updated 3 years ago
- ☆39Feb 4, 2026Updated last week
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆76Aug 2, 2023Updated 2 years ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆197Dec 19, 2025Updated last month
- Source files to replicate experiments in my ICLR 2022 paper.☆71Jul 17, 2025Updated 6 months ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- ☆86Jan 9, 2026Updated last month
- [ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)☆48Mar 9, 2025Updated 11 months ago
- [ICLR 2023] Choreographer: a world-model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able t…☆41Jun 18, 2024Updated last year
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆274Mar 18, 2025Updated 10 months ago