nakamotoo/Cal-QL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nakamotoo/Cal-QL)

nakamotoo / Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)

☆123

Alternatives and similar repositories for Cal-QL

Users that are interested in Cal-QL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ikostrikov / rlpd
View on GitHub
☆409Feb 13, 2023Updated 3 years ago
young-geng / JaxCQL
View on GitHub
Conservative Q learning in Jax
☆58Feb 7, 2023Updated 3 years ago
zhouzypaul / wsrl
View on GitHub
JAX implementation of WSRL and RL baselines | ICLR 2025
☆145Feb 26, 2026Updated 4 months ago
Haichao-Zhang / PEX
View on GitHub
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
☆64Apr 4, 2023Updated 3 years ago
SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
shlee94 / Off2OnRL
View on GitHub
☆61Feb 3, 2023Updated 3 years ago
linhlpv / awesome-offline-to-online-RL-papers
View on GitHub
A list of Offline to Online RL papers (continually updated)
☆103Updated this week
MaxSobolMark / PolicyAgnosticRL
View on GitHub
☆92Aug 4, 2025Updated 11 months ago
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
tinkoff-ai / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆79Jun 23, 2023Updated 3 years ago
DAVIAN-Robotics / SimbaV2
View on GitHub
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆108Nov 4, 2025Updated 8 months ago
huxiao09 / QPA
View on GitHub
☆13Sep 24, 2024Updated last year
ikostrikov / implicit_q_learning
View on GitHub
☆330Jan 23, 2022Updated 4 years ago
hari-sikchi / AWAC
View on GitHub
Advantage weighted Actor Critic for Offline RL
☆53Aug 27, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
young-geng / CQL
View on GitHub
Conservative Q Learning on top of SAC
☆141Oct 15, 2022Updated 3 years ago
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,368Aug 3, 2023Updated 2 years ago
alexanderswerdlow / faster
View on GitHub
☆29Jun 30, 2026Updated 3 weeks ago
dibyaghosh / icvf_release
View on GitHub
Public code for "Reinforcement Learning from Passive Data via Latent Intentions"
☆89Nov 19, 2023Updated 2 years ago
MaxSobolMark / OOO
View on GitHub
Official repo for Offline RL for Online RL
☆18Oct 14, 2023Updated 2 years ago
CMU-AIRe / floq
View on GitHub
Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL
☆46Apr 7, 2026Updated 3 months ago
Facebear-ljx / PROTO
View on GitHub
☆17May 25, 2023Updated 3 years ago
philippe-eecs / IDQL
View on GitHub
Repo for Implicit Diffusion Q-Learning
☆126Dec 5, 2023Updated 2 years ago
pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yudasong / HyQ
View on GitHub
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
☆24Feb 16, 2023Updated 3 years ago
tongzhoumu / policy_decorator
View on GitHub
Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"
☆117Oct 24, 2025Updated 9 months ago
seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆200Aug 2, 2025Updated 11 months ago
LAMDA-RL / OfflineRL-Lib
View on GitHub
Benchmarked implementations of Offline RL Algorithms.
☆77Mar 4, 2025Updated last year
seohongpark / HIQL
View on GitHub
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆98Dec 1, 2024Updated last year
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆441Jan 14, 2026Updated 6 months ago
aviralkumar2907 / CQL
View on GitHub
Code for conservative Q-learning
☆486Dec 7, 2021Updated 4 years ago
dojeon-ai / Atari-PB
View on GitHub
Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)
☆11Sep 16, 2025Updated 10 months ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
conglu1997 / SynthER
View on GitHub
Synthetic Experience Replay
☆114Apr 16, 2026Updated 3 months ago
yihaosun1124 / OfflineRL-Kit
View on GitHub
An elegant PyTorch offline reinforcement learning library for researchers.
☆393May 2, 2026Updated 2 months ago
ColinQiyangLi / qc
View on GitHub
☆393Feb 5, 2026Updated 5 months ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆321Jul 21, 2025Updated last year
t6-thu / H2O
View on GitHub
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
☆59Sep 24, 2023Updated 2 years ago
Farama-Foundation / D4RL
View on GitHub
A collection of reference environments for offline reinforcement learning
☆1,694Nov 18, 2024Updated last year
jianlanluo / SAQ
View on GitHub
☆34Jun 9, 2025Updated last year