rll-research / BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆121Updated 3 years ago
Alternatives and similar repositories for BPref:
Users that are interested in BPref are comparing it to the libraries listed below
- ☆47Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆67Updated last year
- Conservative Q Learning on top of SAC☆131Updated 2 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- A PyTorch implementation of Implicit Q-Learning☆81Updated 3 years ago
- Official code repository for Prompt-DT.☆109Updated 2 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆126Updated last year
- ☆39Updated last year
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆147Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆178Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 11 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆80Updated 5 months ago
- ☆55Updated 2 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆27Updated last year
- Representation Learning for RL☆126Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆54Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆126Updated 9 months ago
- ☆265Updated 3 years ago
- ☆26Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆167Updated 5 months ago
- CORRO code☆35Updated 2 years ago
- ☆48Updated last year
- ☆53Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆24Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆135Updated last year