BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
☆135Updated 10 months ago
Alternatives and similar repositories for CQL:
Users that are interested in CQL are comparing it to the libraries listed below
- A collection of offline reinforcement learning algorithms.☆174Updated 3 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆165Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆123Updated 7 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆347Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆160Updated 4 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆174Updated 2 years ago
- Conservative Q Learning on top of SAC☆127Updated 2 years ago
- ☆256Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆137Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.☆310Updated 11 months ago
- ☆193Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆195Updated 6 months ago
- Distributional Soft Actor Critic☆52Updated 4 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆171Updated 7 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- ☆108Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆76Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆180Updated 2 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆210Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 7 months ago
- Code for conservative Q-learning☆426Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- Model-Based Offline Reinforcement Learning☆48Updated 4 years ago
- Constrained Policy Optimization implementation on Safety Gym☆23Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆137Updated 10 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆101Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆65Updated last year
- There will be updates later☆84Updated 5 years ago