seohongpark/horizon-reduction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/seohongpark/horizon-reduction)

seohongpark / horizon-reduction

The official implementation of "Horizon Reduction Makes RL Scalable"

☆200

Alternatives and similar repositories for horizon-reduction

Users that are interested in horizon-reduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆321Jul 21, 2025Updated last year
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆443Jan 14, 2026Updated 6 months ago
ColinQiyangLi / dqc
View on GitHub
Decoupled Q-Chunking
☆74May 3, 2026Updated 2 months ago
kwanyoungpark / MAC
View on GitHub
Code for Scalable Offline Model-Based RL with Action chunking
☆30Feb 20, 2026Updated 5 months ago
deepindermann / dual-goal-representations
View on GitHub
The official implementation of "Dual Goal Representations"
☆39Oct 7, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ColinQiyangLi / qc
View on GitHub
☆396Feb 5, 2026Updated 5 months ago
aoberai / trl
View on GitHub
Code for "Transitive RL: Value Learning via Divide and Conquer"
☆60Oct 31, 2025Updated 8 months ago
zhouzypaul / wsrl
View on GitHub
JAX implementation of WSRL and RL baselines | ICLR 2025
☆145Feb 26, 2026Updated 5 months ago
CMU-AIRe / floq
View on GitHub
Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL
☆46Apr 7, 2026Updated 3 months ago
ColinQiyangLi / qam
View on GitHub
Q-learning with Adjoint Matching
☆109May 11, 2026Updated 2 months ago
chongyi-zheng / value-flows
View on GitHub
The official implementation of Value Flows
☆55Feb 27, 2026Updated 5 months ago
DAVIAN-Robotics / SimbaV2
View on GitHub
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆108Nov 4, 2025Updated 8 months ago
kvfrans / cfgrl
View on GitHub
☆109May 31, 2025Updated last year
kvfrans / fre
View on GitHub
Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"
☆57Mar 26, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
RajGhugare19 / builderbench
View on GitHub
☆35Mar 26, 2026Updated 4 months ago
alexanderswerdlow / faster
View on GitHub
☆30Jun 30, 2026Updated 3 weeks ago
roger-creus / stable-deep-rl-at-scale
View on GitHub
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…
☆39Oct 24, 2025Updated 9 months ago
pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
nicklashansen / newt
View on GitHub
Official code repository for the paper "Learning Massively Multitask World Models for Continuous Control".
☆129Jan 9, 2026Updated 6 months ago
WJ2003B / mqe-release
View on GitHub
Official Release of Multistep Quasimetric Estimation (MQE)
☆18Mar 13, 2026Updated 4 months ago
younggyoseo / FastTD3
View on GitHub
☆455May 16, 2026Updated 2 months ago
MaxSobolMark / PolicyAgnosticRL
View on GitHub
☆92Aug 4, 2025Updated 11 months ago
SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pd-perry / EXPO
View on GitHub
☆34Aug 25, 2025Updated 11 months ago
seohongpark / METRA
View on GitHub
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆92Oct 15, 2023Updated 2 years ago
aoberai / rql
View on GitHub
Code for "Reversal Q-Learning (RQL)" for Flow RL from Prior Data
☆33Jun 17, 2026Updated last month
Viraj-Joshi / MTBench
View on GitHub
☆45Jul 1, 2026Updated 3 weeks ago
EmptyJackson / unifloral
View on GitHub
Unified Implementations of Offline Reinforcement Learning Algorithms
☆225Dec 19, 2025Updated 7 months ago
seohongpark / HIQL
View on GitHub
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆98Dec 1, 2024Updated last year
kwanyoungpark / LEQ
View on GitHub
Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning
☆19Feb 6, 2025Updated last year
facebookresearch / MRQ
View on GitHub
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆154Apr 7, 2026Updated 3 months ago
MichalBortkiewicz / JaxGCRL
View on GitHub
Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.
☆274Jun 6, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sukhijab / maxinforl_jax
View on GitHub
☆29Jan 8, 2026Updated 6 months ago
lilucse / SparseNetwork4DRL
View on GitHub
[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
☆41Jun 5, 2025Updated last year
nakamotoo / dsrl_pi0
View on GitHub
Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
☆281Apr 27, 2026Updated 3 months ago
ankile / robust-rearrangement
View on GitHub
From Imitation to Refinement -- Residual RL for Precise Assembly
☆246Dec 2, 2025Updated 7 months ago
nissymori / JAX-CORL
View on GitHub
Clean single-file implementation of offline RL algorithms in JAX
☆182Jun 5, 2026Updated last month
csmile-1006 / DEAS-Isaac-GR00T
View on GitHub
DEAS + Isaac-GR00T + RoboCasa
☆20Nov 22, 2025Updated 8 months ago
wang-kevin3290 / scaling-crl
View on GitHub
☆307Updated this week