A Library for Active Preference-based Reward Learning Algorithms
☆54Dec 16, 2023Updated 2 years ago
Alternatives and similar repositories for APReL
Users that are interested in APReL are comparing it to the libraries listed below
Sorting:
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"☆12May 20, 2019Updated 6 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆133Nov 3, 2021Updated 4 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆30May 29, 2019Updated 6 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- ☆53Nov 10, 2022Updated 3 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Jun 18, 2024Updated last year
- ☆37Apr 27, 2023Updated 2 years ago
- ☆13Sep 24, 2024Updated last year
- docker image for reinforcement learning including Open AI roboschool☆13Jun 16, 2019Updated 6 years ago
- my docker images (ROS, SHSA, ..)☆13Nov 18, 2019Updated 6 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 3 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Jul 20, 2024Updated last year
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆22Aug 4, 2022Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆20Nov 18, 2022Updated 3 years ago
- [CoRL 2020] Learning a natural-language to LTL executable semantic parser for grounded robotics☆16Jul 31, 2022Updated 3 years ago
- ☆21Dec 17, 2020Updated 5 years ago
- Official codebase for Human Guided Exploration (HuGE)☆22Aug 16, 2023Updated 2 years ago
- Implementation of the TAMER algorithm from "Interactively Shaping Agents via Human Reinforcement" (Knox, Stone - 2009)☆21May 6, 2020Updated 5 years ago
- ☆23Oct 31, 2023Updated 2 years ago
- A library to benchmark reinforcement learning algorithms☆21Apr 18, 2018Updated 7 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- Official codebase for Sirius: Robot Learning on the Job☆61Oct 26, 2023Updated 2 years ago
- SeSaMe TAMP + Learning integrated with a Spot robot!☆28Feb 19, 2026Updated last week
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆28Jan 12, 2023Updated 3 years ago
- Companion Codebase for "No, to the Right – Online Language Corrections for Robotic Manipulation via Shared Autonomy"☆28Dec 13, 2022Updated 3 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Jun 19, 2016Updated 9 years ago
- Modelling epidemiological dynamics and performing inference in these models☆27Jul 30, 2021Updated 4 years ago
- ☆30Jul 12, 2023Updated 2 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Dec 11, 2020Updated 5 years ago
- Cost-aware Bayesian optimization via the Pandora's box Gittins index☆14Aug 8, 2025Updated 6 months ago
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆78Dec 5, 2023Updated 2 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆36Oct 19, 2023Updated 2 years ago
- ☆138Feb 26, 2019Updated 7 years ago
- ☆86Apr 10, 2021Updated 4 years ago
- ☆43May 25, 2023Updated 2 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆333Nov 29, 2021Updated 4 years ago