linhlpv/awesome-offline-to-online-RL-papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linhlpv/awesome-offline-to-online-RL-papers)

linhlpv / awesome-offline-to-online-RL-papers

A list of Offline to Online RL papers (continually updated)

☆102

Alternatives and similar repositories for awesome-offline-to-online-RL-papers

Users that are interested in awesome-offline-to-online-RL-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nktoan / Causal-Inference-via-Style-Transfer-for-OOD-Generalisation
View on GitHub
[KDD 2023] Causal Inference via Style Transfer for Out-of-distribution Generalisation
☆29Feb 29, 2024Updated 2 years ago
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆123Jul 31, 2024Updated last year
phantrdat / neuralbo
View on GitHub
☆13Apr 16, 2024Updated 2 years ago
nktoan / CRR-CausalRelationalReplay
View on GitHub
This repository hosts the codebase corresponding to our paper, published at Expert Systems With Applications, titled 'Class-Incremental L…
☆14Jun 11, 2024Updated 2 years ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
aoberai / rql
View on GitHub
Code for "Reversal Q-Learning (RQL)" for Flow RL from Prior Data
☆32Jun 17, 2026Updated last month
thuml / SPOT
View on GitHub
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
☆22Jun 24, 2023Updated 3 years ago
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆320Jul 21, 2025Updated last year
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
Roythuly / OBAC
View on GitHub
☆22May 27, 2024Updated 2 years ago
hari-sikchi / AWAC
View on GitHub
Advantage weighted Actor Critic for Offline RL
☆53Aug 27, 2022Updated 3 years ago
ColinQiyangLi / dqc
View on GitHub
Decoupled Q-Chunking
☆73May 3, 2026Updated 2 months ago
ruoqizzz / entropy-offlineRL
View on GitHub
code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"
☆21Feb 24, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
nktoan / risk-distribution-matching
View on GitHub
[WACV 2024] Domain Generalisation via Risk Distribution Matching
☆23Sep 19, 2024Updated last year
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 2 years ago
nissymori / remax-rl
View on GitHub
[ICML2026] Official JAX code for Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
☆15Jul 3, 2026Updated 2 weeks ago
baosws / DrBO
View on GitHub
Causal Discovery via Bayesian Optimization (DrBO) - ICLR 2025
☆24Apr 13, 2025Updated last year
nktoan / h-edit
View on GitHub
[CVPR 2025] h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform
☆77Jun 11, 2025Updated last year
chongyi-zheng / value-flows
View on GitHub
The official implementation of Value Flows
☆55Feb 27, 2026Updated 4 months ago
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆63Aug 3, 2023Updated 2 years ago
ltlhuuu / A2PR
View on GitHub
[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…
☆34May 31, 2024Updated 2 years ago
ikostrikov / rlpd
View on GitHub
☆409Feb 13, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,367Aug 3, 2023Updated 2 years ago
nguyennm1024 / OSCaR
View on GitHub
🔥🔥🔥 Object State Description & Change Detection
☆10Apr 6, 2026Updated 3 months ago
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆438Jan 14, 2026Updated 6 months ago
zhouzypaul / wsrl
View on GitHub
JAX implementation of WSRL and RL baselines | ICLR 2025
☆145Feb 26, 2026Updated 4 months ago
polixir / OfflineRL
View on GitHub
A collection of offline reinforcement learning algorithms.
☆211Nov 26, 2024Updated last year
hanjuku-kaso / awesome-offline-rl
View on GitHub
An index of algorithms for offline reinforcement learning (offline-rl)
☆1,073May 23, 2024Updated 2 years ago
zaiyan-x / RFQI
View on GitHub
Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
☆25Nov 9, 2024Updated last year
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆44Feb 9, 2026Updated 5 months ago
opendilab / awesome-diffusion-model-in-rl
View on GitHub
A curated list of Diffusion Model in RL resources (continually updated)
☆1,630May 30, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ColinQiyangLi / qc
View on GitHub
☆392Feb 5, 2026Updated 5 months ago
LAMDA-RL / OfflineRL-Lib
View on GitHub
Benchmarked implementations of Offline RL Algorithms.
☆77Mar 4, 2025Updated last year
CMU-AIRe / floq
View on GitHub
Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL
☆46Apr 7, 2026Updated 3 months ago
yihaosun1124 / OfflineRL-Kit
View on GitHub
An elegant PyTorch offline reinforcement learning library for researchers.
☆393May 2, 2026Updated 2 months ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
nissymori / JAX-CORL
View on GitHub
Clean single-file implementation of offline RL algorithms in JAX
☆182Jun 5, 2026Updated last month
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago