A list of Offline to Online RL papers (continually updated)
☆94Apr 25, 2026Updated last month
Alternatives and similar repositories for awesome-offline-to-online-RL-papers
Users that are interested in awesome-offline-to-online-RL-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Apr 16, 2024Updated 2 years ago
- [KDD 2023] Causal Inference via Style Transfer for Out-of-distribution Generalisation☆28Feb 29, 2024Updated 2 years ago
- Causal Discovery via Bayesian Optimization (DrBO) - ICLR 2025☆24Apr 13, 2025Updated last year
- [WACV 2024] Domain Generalisation via Risk Distribution Matching☆23Sep 19, 2024Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆22Jun 24, 2023Updated 2 years ago
- [CVPR 2025] h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform☆77Jun 11, 2025Updated 11 months ago
- ☆22May 27, 2024Updated 2 years ago
- ☆64Jan 30, 2026Updated 4 months ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆63Aug 3, 2023Updated 2 years ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆32Nov 12, 2024Updated last year
- Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]☆25Nov 9, 2024Updated last year
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 5 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- Advantage weighted Actor Critic for Offline RL☆53Aug 27, 2022Updated 3 years ago
- Synthetic Experience Replay☆112Apr 16, 2026Updated last month
- [ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…☆34May 31, 2024Updated 2 years ago
- Official implementation of Bidirectional Diffusion Bridge Models☆24May 26, 2025Updated last year
- code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"☆21Feb 24, 2024Updated 2 years ago
- Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…☆39Oct 24, 2025Updated 7 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆46Jul 27, 2023Updated 2 years ago
- solving ml10☆26Nov 10, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The official PyTorch implementation of the paper "Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regul…☆15Nov 10, 2024Updated last year
- Official repo for arxiv paper "Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion I…☆17Nov 8, 2024Updated last year
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,360Aug 3, 2023Updated 2 years ago
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 5 years ago
- Benchmarked implementations of Offline RL Algorithms.☆77Mar 4, 2025Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆30Apr 8, 2026Updated 2 months ago
- ☆64Nov 15, 2024Updated last year
- Official repo for Offline RL for Online RL☆18Oct 14, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆38Dec 30, 2024Updated last year
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,063May 23, 2024Updated 2 years ago
- ☆127Feb 25, 2025Updated last year
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- [AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning☆18May 21, 2025Updated last year
- A curated list of Diffusion Model in RL resources (continually updated)☆1,610May 30, 2026Updated last week
- An elegant PyTorch offline reinforcement learning library for researchers.☆391May 2, 2026Updated last month