mschweizer / Pref-RLLinks
Pref-RL provides ready-to-use PbRL agents that are easily extensible.
☆11Updated 3 years ago
Alternatives and similar repositories for Pref-RL
Users that are interested in Pref-RL are comparing it to the libraries listed below
Sorting:
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆330Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆212Updated last year
- A PyTorch implementation of Implicit Q-Learning☆86Updated 3 years ago
- ☆43Updated 2 years ago
- ☆284Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆148Updated 2 years ago
- Datasets with baselines for Offline MARL.☆179Updated last month
- An elegant PyTorch offline reinforcement learning library for researchers.☆358Updated 2 months ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆130Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆373Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆189Updated last year
- A collection of offline reinforcement learning algorithms.☆198Updated 10 months ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆104Updated last year
- ☆237Updated 10 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆172Updated 10 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆141Updated last year
- ☆201Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆179Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆134Updated last year
- Partially Observable Process Gym☆200Updated 3 months ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆76Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆70Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆189Updated 3 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆84Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆75Updated 6 months ago
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- Re-implementations of SOTA RL algorithms.☆135Updated 2 years ago
- Conservative Q Learning on top of SAC☆132Updated 2 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆16Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year