x35f / unstable_baselines
Re-implementations of SOTA RL algorithms.
β131Updated last year
Alternatives and similar repositories for unstable_baselines:
Users that are interested in unstable_baselines are comparing it to the libraries listed below
- Benchmarked implementations of Offline RL Algorithms.β73Updated last month
- π€ Elegant implementations of offline safe RL algorithms in PyTorchβ196Updated 7 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ167Updated 3 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasetsβ117Updated 4 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and coβ¦β135Updated 11 months ago
- β261Updated 3 years ago
- Conservative Q Learning on top of SACβ130Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimizationβ177Updated 2 years ago
- RLA is a tool for managing your RL experiments automaticallyβ71Updated 2 years ago
- β108Updated 2 years ago
- β194Updated 2 years ago
- A PyTorch implementation of Implicit Q-Learningβ77Updated 3 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)β55Updated 11 months ago
- A python module designed for agile RL algorithm developing.β26Updated 9 months ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.β25Updated 2 weeks ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.β121Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RLβ141Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β163Updated 5 months ago
- A collection of offline reinforcement learning algorithms.β176Updated 4 months ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)β75Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.β85Updated 3 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (teβ¦β166Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"β58Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam searchβ111Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β82Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorchβ46Updated 2 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPSβ¦β73Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTTβ139Updated 11 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"β100Updated 2 years ago
- Datasets with baselines for offline multi-agent reinforcement learning.β162Updated last week