roger-creus/stable-deep-rl-at-scale

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/roger-creus/stable-deep-rl-at-scale)

roger-creus / stable-deep-rl-at-scale

Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments!

☆39

Alternatives and similar repositories for stable-deep-rl-at-scale

Users that are interested in stable-deep-rl-at-scale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

roger-creus / ale-nl
View on GitHub
A framework for evaluating LLMs in Atari games
☆15Apr 21, 2025Updated last year
lilucse / SparseNetwork4DRL
View on GitHub
[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
☆41Jun 5, 2025Updated last year
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆44Feb 9, 2026Updated 5 months ago
SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
DAVIAN-Robotics / SimbaV2
View on GitHub
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆108Nov 4, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
naumix / BiggerRegularizedOptimistic
View on GitHub
Official implementation of the BRO algorithm
☆61Jan 29, 2025Updated last year
Guozheng-Ma / Adaptive-Replay-Ratio
View on GitHub
[ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.
☆13Oct 9, 2024Updated last year
seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆200Aug 2, 2025Updated 11 months ago
JesseFarebro / xtils
View on GitHub
A collection of utilities for machine learning experiments.
☆11Jan 8, 2026Updated 6 months ago
wang-kevin3290 / scaling-crl
View on GitHub
☆307Updated this week
RajGhugare19 / builderbench
View on GitHub
☆35Mar 26, 2026Updated 4 months ago
rai-opensource / q2rl
View on GitHub
Q-Estimation and Q-Gating from BC for RL
☆44Jul 8, 2026Updated 3 weeks ago
alexanderswerdlow / faster
View on GitHub
☆30Jun 30, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vivekmyers / empowerment_successor_representations
View on GitHub
Code for the paper "Learning to Assist Humans without Inferring Rewards"
☆20Jul 7, 2024Updated 2 years ago
facebookresearch / MRQ
View on GitHub
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆154Apr 7, 2026Updated 3 months ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 8 months ago
MichaelTMatthews / purejaxgcrl
View on GitHub
GCRL in JAX. Official repository for LEO (ICML 2026).
☆28Jun 20, 2026Updated last month
aoberai / trl
View on GitHub
Code for "Transitive RL: Value Learning via Divide and Conquer"
☆60Oct 31, 2025Updated 8 months ago
EmptyJackson / unifloral
View on GitHub
Unified Implementations of Offline Reinforcement Learning Algorithms
☆225Dec 19, 2025Updated 7 months ago
Princeton-RL / CRTR
View on GitHub
Official code for the paper "Contrastive Representations for Temporal Reasoning".
☆57Nov 25, 2025Updated 8 months ago
cvoelcker / reppo
View on GitHub
Official Code for "Relative Entropy Pathwise Policy Optimization"
☆59May 6, 2026Updated 2 months ago
isaac7778 / FIRE
View on GitHub
Code for the paper "FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability–Plasticity Tradeoff" (ICLR 2026 Oral)
☆30Apr 27, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sail-sg / ContinualBench
View on GitHub
☆25May 20, 2025Updated last year
MichalBortkiewicz / JaxGCRL
View on GitHub
Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.
☆274Jun 6, 2026Updated last month
samlobel / CFN
View on GitHub
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
☆25Dec 29, 2023Updated 2 years ago
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
luchris429 / model-free-opponent-shaping
View on GitHub
Code for Model-Free Opponent Shaping (ICML 2022)
☆24Nov 18, 2022Updated 3 years ago
enjeeneer / zero-shot-rl
View on GitHub
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆29Jan 14, 2025Updated last year
Howuhh / streaming-drl-jax
View on GitHub
streaming deep reinforcement learning but 4x faster with jax!
☆19Jan 4, 2026Updated 6 months ago
nicklashansen / newt
View on GitHub
Official code repository for the paper "Learning Massively Multitask World Models for Continuous Control".
☆128Jan 9, 2026Updated 6 months ago
aoberai / rql
View on GitHub
Code for "Reversal Q-Learning (RQL)" for Flow RL from Prior Data
☆34Jun 17, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
XuGW-Kevin / DrM
View on GitHub
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆78Feb 19, 2026Updated 5 months ago
CMU-AIRe / floq
View on GitHub
Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL
☆46Apr 7, 2026Updated 3 months ago
HaoxiangYou / D.VA
View on GitHub
Official Implementation of Accelerating Visual-Policy Learning through Parallel Differentiable Simulation
☆41Nov 10, 2025Updated 8 months ago
zhouzypaul / wsrl
View on GitHub
JAX implementation of WSRL and RL baselines | ICLR 2025
☆146Feb 26, 2026Updated 5 months ago
naumix / BiggerRegularizedCategorical
View on GitHub
☆17Apr 23, 2026Updated 3 months ago
younggyoseo / FastTD3
View on GitHub
☆455May 16, 2026Updated 2 months ago
aalmuzairee / dmcgb2
View on GitHub
Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)
☆22Jul 21, 2025Updated last year