Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆133Nov 3, 2021Updated 4 years ago
Alternatives and similar repositories for BPref
Users that are interested in BPref are comparing it to the libraries listed below
Sorting:
- ☆53Nov 10, 2022Updated 3 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Jun 18, 2024Updated last year
- ☆37Apr 27, 2023Updated 2 years ago
- ☆43May 25, 2023Updated 2 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- ☆18Jun 8, 2023Updated 2 years ago
- A Library for Active Preference-based Reward Learning Algorithms☆54Dec 16, 2023Updated 2 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- [NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…☆26Feb 15, 2025Updated last year
- ☆14Oct 11, 2022Updated 3 years ago
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Jun 28, 2023Updated 2 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆33Dec 7, 2024Updated last year
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆19Jul 11, 2023Updated 2 years ago
- ☆60Apr 16, 2023Updated 2 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- Code for Contrastive Preference Learning (CPL)☆179Nov 22, 2024Updated last year
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆20Mar 4, 2023Updated 3 years ago
- ☆317Jan 23, 2022Updated 4 years ago
- Pref-RL provides ready-to-use PbRL agents that are easily extensible.☆11Aug 31, 2022Updated 3 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- ☆360Oct 12, 2022Updated 3 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆28Jan 12, 2023Updated 3 years ago
- Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration☆53Nov 8, 2021Updated 4 years ago
- ☆60Feb 3, 2023Updated 3 years ago
- ☆13Sep 24, 2024Updated last year
- A collection of reference environments for offline reinforcement learning☆1,656Nov 18, 2024Updated last year
- Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning☆1,749Jan 20, 2026Updated last month
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆93Dec 1, 2024Updated last year
- Multi-Objective Reinforcement Learning☆296Aug 10, 2021Updated 4 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Simulation environments for Multi-Objective Reinforcement Learning (MORL)☆17Aug 2, 2022Updated 3 years ago
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆125Oct 9, 2020Updated 5 years ago
- ☆32Mar 10, 2024Updated last year
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- ☆30Jun 4, 2022Updated 3 years ago