The official implementation of "Horizon Reduction Makes RL Scalable"
☆197Aug 2, 2025Updated 10 months ago
Alternatives and similar repositories for horizon-reduction
Users that are interested in horizon-reduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of flow Q-learning (FQL)☆314Jul 21, 2025Updated 10 months ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆104Nov 4, 2025Updated 7 months ago
- Code for Scalable Offline Model-Based RL with Action chunking☆29Feb 20, 2026Updated 3 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆416Jan 14, 2026Updated 5 months ago
- ☆383Feb 5, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆101May 31, 2025Updated last year
- JAX implementation of WSRL and RL baselines | ICLR 2025☆142Feb 26, 2026Updated 3 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆91Oct 15, 2023Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆56Mar 26, 2024Updated 2 years ago
- ☆91Aug 4, 2025Updated 10 months ago
- Official implementation of the BRO algorithm☆61Jan 29, 2025Updated last year
- ☆127Feb 25, 2025Updated last year
- Repo for Implicit Diffusion Q-Learning☆125Dec 5, 2023Updated 2 years ago
- Q-learning with Adjoint Matching☆93May 11, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆447May 16, 2026Updated last month
- Clean single-file implementation of offline RL algorithms in JAX☆180Jun 5, 2026Updated 2 weeks ago
- ☆64Jan 30, 2026Updated 4 months ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆270Jun 6, 2026Updated last week
- The official implementation of Value Flows☆54Feb 27, 2026Updated 3 months ago
- The official implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆36May 11, 2026Updated last month
- Official Code for "Relative Entropy Pathwise Policy Optimization"☆55May 6, 2026Updated last month
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 6 months ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Coarse-to-fine Q-Network☆59Aug 6, 2024Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆97Dec 1, 2024Updated last year
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆258May 21, 2026Updated 3 weeks ago
- ☆21Feb 6, 2025Updated last year
- ☆98Jan 21, 2026Updated 4 months ago
- From Imitation to Refinement -- Residual RL for Precise Assembly☆240Dec 2, 2025Updated 6 months ago
- ☆32Jun 21, 2024Updated last year
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆862May 21, 2025Updated last year
- BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.☆23May 11, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- Official repo for arxiv paper "Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion I…☆17Nov 8, 2024Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆80Aug 18, 2024Updated last year
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆32Nov 12, 2024Updated last year
- ☆11Nov 1, 2022Updated 3 years ago
- Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization (CoRL 2021)☆37May 3, 2022Updated 4 years ago
- An open-source library for GPU-accelerated robot learning and sim-to-real transfer.☆2,006Updated this week