watchernyu / REDQLinks
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆168Updated 6 months ago
Alternatives and similar repositories for REDQ
Users that are interested in REDQ are comparing it to the libraries listed below
Sorting:
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆168Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆128Updated 10 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆361Updated 3 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆143Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆179Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆137Updated last year
- ☆196Updated 2 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆68Updated last year
- DSAC; Distributional Soft Actor-Critic☆126Updated 3 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆223Updated 4 years ago
- Conservative Q Learning on top of SAC☆130Updated 2 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆166Updated 4 months ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆205Updated 8 months ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆239Updated 5 years ago
- A collection of offline reinforcement learning algorithms.☆185Updated 6 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- Distributional Soft Actor Critic☆53Updated 4 years ago
- ☆270Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆179Updated 11 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆189Updated 2 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆92Updated 8 months ago
- There will be updates later☆84Updated 6 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆145Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆78Updated last year
- PyTorch implementation of FQF, IQN and QR-DQN.☆177Updated 10 months ago
- Constrained Policy Optimization implementation on Safety Gym☆27Updated 3 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆288Updated 4 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago