polixir / OfflineRL
A collection of offline reinforcement learning algorithms.
☆174Updated 3 months ago
Alternatives and similar repositories for OfflineRL:
Users that are interested in OfflineRL are comparing it to the libraries listed below
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆132Updated 10 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆164Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆347Updated 3 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆307Updated 10 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆210Updated 4 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆122Updated 7 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆165Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆194Updated 5 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆161Updated 10 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆100Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- ☆108Updated last year
- Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation☆63Updated 3 years ago
- Constrained Policy Optimization implementation on Safety Gym☆23Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- The implement of all kinds of dqn reinforcement learning with Pytorch☆94Updated 3 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆115Updated 3 months ago
- ☆196Updated last year
- A plotter for reinforcement learning (RL)☆218Updated 3 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆73Updated 2 months ago
- DSAC; Distributional Soft Actor-Critic☆125Updated last month
- Code for MOPO: Model-based Offline Policy Optimization☆173Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆160Updated 3 months ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆134Updated last year
- Code for Weighted QMIX☆130Updated 4 years ago
- PyTorch implementation of Constrained Policy Optimization☆52Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆105Updated 2 years ago
- Conservative Q Learning on top of SAC☆126Updated 2 years ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆31Updated last year