BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
☆135Updated 10 months ago
Alternatives and similar repositories for CQL:
Users that are interested in CQL are comparing it to the libraries listed below
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆165Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆122Updated 7 months ago
- A collection of offline reinforcement learning algorithms.☆174Updated 3 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆347Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆160Updated 4 months ago
- Conservative Q Learning on top of SAC☆126Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆195Updated 6 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆210Updated 4 years ago
- ☆193Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆174Updated 2 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆170Updated 7 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆136Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- ☆254Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆101Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆65Updated last year
- Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation☆63Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.☆307Updated 11 months ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆179Updated 2 years ago
- A PyTorch implementation of Implicit Q-Learning☆74Updated 3 years ago
- ☆108Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆76Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆160Updated 8 months ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆148Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆119Updated 3 years ago
- Code for conservative Q-learning☆426Updated 3 years ago
- PyTorch implementation of SAC-Discrete.☆299Updated 7 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆169Updated 8 months ago