Conservative Q Learning on top of SAC
☆138Oct 15, 2022Updated 3 years ago
Alternatives and similar repositories for CQL
Users that are interested in CQL are comparing it to the libraries listed below
Sorting:
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- Code for conservative Q-learning☆472Dec 7, 2021Updated 4 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆393Dec 18, 2021Updated 4 years ago
- ☆317Jan 23, 2022Updated 4 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆147May 6, 2024Updated last year
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆79Aug 14, 2022Updated 3 years ago
- A PyTorch implementation of Implicit Q-Learning☆95Oct 23, 2021Updated 4 years ago
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆12Jul 4, 2022Updated 3 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191May 17, 2022Updated 3 years ago
- An offline deep reinforcement learning library☆1,645Sep 10, 2025Updated 5 months ago
- ☆202Mar 25, 2023Updated 2 years ago
- ☆60Feb 3, 2023Updated 3 years ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆47Jul 27, 2023Updated 2 years ago
- A simple and easy to use implementation of the soft actor-critic algorithm.☆15Sep 2, 2022Updated 3 years ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆752Oct 26, 2022Updated 3 years ago
- A collection of reference environments for offline reinforcement learning☆1,649Nov 18, 2024Updated last year
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,052May 23, 2024Updated last year
- [ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps☆12Apr 10, 2025Updated 10 months ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,328Aug 3, 2023Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆120Jul 31, 2024Updated last year
- Advantage weighted Actor Critic for Offline RL☆52Aug 27, 2022Updated 3 years ago
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆119Feb 11, 2025Updated last year
- ☆385Feb 13, 2023Updated 3 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Apr 6, 2023Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆208Nov 26, 2024Updated last year
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Feb 27, 2023Updated 3 years ago
- ☆80Dec 9, 2022Updated 3 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆27Jan 21, 2022Updated 4 years ago
- code for CoRL 2020 paper "Contrastive Variational Model-Based Reinforcement Learning for Complex Observations"☆24Dec 29, 2021Updated 4 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- ☆20May 25, 2023Updated 2 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆53Oct 18, 2021Updated 4 years ago