official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆120Jul 31, 2024Updated last year
Alternatives and similar repositories for Cal-QL
Users that are interested in Cal-QL are comparing it to the libraries listed below
Sorting:
- ☆385Feb 13, 2023Updated 3 years ago
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆62Apr 4, 2023Updated 2 years ago
- ☆120Feb 25, 2025Updated last year
- ☆60Feb 3, 2023Updated 3 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆77Jun 23, 2023Updated 2 years ago
- ☆63Jan 30, 2026Updated last month
- A list of Offline to Online RL papers (continually updated)☆71Nov 27, 2025Updated 3 months ago
- Advantage weighted Actor Critic for Offline RL☆52Aug 27, 2022Updated 3 years ago
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,329Aug 3, 2023Updated 2 years ago
- ☆13Sep 24, 2024Updated last year
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆91Nov 4, 2025Updated 3 months ago
- Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"☆110Oct 24, 2025Updated 4 months ago
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆25Feb 16, 2023Updated 3 years ago
- The official implementation of flow Q-learning (FQL)☆281Jul 21, 2025Updated 7 months ago
- ☆317Jan 23, 2022Updated 4 years ago
- Conservative Q Learning on top of SAC☆138Oct 15, 2022Updated 3 years ago
- ☆20May 25, 2023Updated 2 years ago
- JAX implementation of WSRL and RL baselines | ICLR 2025☆132Updated this week
- ☆87Aug 4, 2025Updated 7 months ago
- Code for Scalable Offline Model-Based RL with Action chunking☆18Feb 20, 2026Updated last week
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆181Aug 2, 2025Updated 7 months ago
- Scripts to recreate the D4RL datasets with Minari☆26Jul 21, 2025Updated 7 months ago
- Official repo for Offline RL for Online RL☆19Oct 14, 2023Updated 2 years ago
- Official implementation of DEMO3☆65Jul 29, 2025Updated 7 months ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 4 years ago
- Repo for Implicit Diffusion Q-Learning☆123Dec 5, 2023Updated 2 years ago
- Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)☆11Sep 16, 2025Updated 5 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆47Jul 27, 2023Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆93Dec 1, 2024Updated last year
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆38Mar 1, 2021Updated 5 years ago
- Code for conservative Q-learning☆474Dec 7, 2021Updated 4 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆383Jul 11, 2025Updated 7 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Nov 19, 2023Updated 2 years ago
- Synthetic Experience Replay☆109May 27, 2024Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆105Sep 29, 2025Updated 5 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆339Jan 14, 2026Updated last month