jhejna/inverse-preference-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jhejna/inverse-preference-learning)

jhejna / inverse-preference-learning

☆43

Alternatives and similar repositories for inverse-preference-learning

Users that are interested in inverse-preference-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
chwoong / LiRE
View on GitHub
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
☆18Jun 18, 2024Updated 2 years ago
rll-research / BPref
View on GitHub
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆136Nov 3, 2021Updated 4 years ago
jhejna / few-shot-preference-rl
View on GitHub
☆38Apr 27, 2023Updated 3 years ago
csmile-1006 / PreferenceTransformer
View on GitHub
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆168Oct 15, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
csmile-1006 / ARP
View on GitHub
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
☆33Sep 25, 2023Updated 2 years ago
dwjshift / IL_ADS
View on GitHub
code for the paper Imitation Learning from Observation with Automatic Discount Scheduling
☆13Mar 27, 2024Updated 2 years ago
mit-gfx / PGMORL
View on GitHub
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
☆133Oct 9, 2020Updated 5 years ago
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
ajaysridhar0 / ros_teleop_webapp
View on GitHub
A barebones implementation of remote teleoperation of a ROS-based robot over the internet by using ROSlib
☆17Jan 21, 2023Updated 3 years ago
CU-DitecT / TRC21-PINN-CFM
View on GitHub
☆13Jul 23, 2023Updated 3 years ago
Facebear-ljx / RGM
View on GitHub
The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
☆16Mar 3, 2023Updated 3 years ago
snu-mllab / DPPO
View on GitHub
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
☆43Jul 20, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year
kvfrans / matrix-whitening
View on GitHub
Code for "What really matters in matrix-whitening optimizers?"
☆25Oct 31, 2025Updated 8 months ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
pickxiguapi / Uni-RLHF-Platform
View on GitHub
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…
☆42Nov 20, 2024Updated last year
Cloud0723 / Offline-MLIRL
View on GitHub
☆22Dec 18, 2023Updated 2 years ago
ttumiel / minRLHF
View on GitHub
Minimal RLHF implementation built on top of minGPT.
☆32Jul 4, 2024Updated 2 years ago
DongsuLeeTech / AD4RL
View on GitHub
ICRA 2024
☆18Mar 13, 2024Updated 2 years ago
sparkmxy / my-offlinerl
View on GitHub
☆26Jun 14, 2022Updated 4 years ago
LanqingLi1993 / FOCAL-ICLR
View on GitHub
Code for FOCAL Paper Published at ICLR 2021
☆55Dec 4, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
KAIST-AILab / imitation-dice
View on GitHub
☆17Dec 30, 2024Updated last year
OscarHuangWind / Learning-from-Intervention
View on GitHub
[ICRA 2024] Learning from Human Guidance: Uncertainty-aware deep reinforcement learning for autonomous driving.
☆31Feb 22, 2024Updated 2 years ago
Stanford-ILIAD / lilac
View on GitHub
Companion Codebase for "No, to the Right – Online Language Corrections for Robotic Manipulation via Shared Autonomy"
☆28Dec 13, 2022Updated 3 years ago
seohongpark / HIQL
View on GitHub
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆98Dec 1, 2024Updated last year
zpschang / DPMORL
View on GitHub
☆30Feb 26, 2024Updated 2 years ago
JayYang168 / FJSP
View on GitHub
遗传算法求解柔性车间调度问题
☆13Jun 3, 2023Updated 3 years ago
typoverflow / UtilsRL
View on GitHub
A python module designed for agile RL algorithm developing.
☆26Jul 11, 2024Updated 2 years ago
NHirose / ExAug
View on GitHub
☆11Mar 15, 2023Updated 3 years ago
renwang435 / pgr
View on GitHub
Prioritized Generative Replay (ICLR 2025 Oral)
☆29Mar 1, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
junming-yang / mopo
View on GitHub
Model-based Offline Policy Optimization re-implement all by pytorch
☆43Sep 13, 2023Updated 2 years ago
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
GitHubLuCheng / LTEE
View on GitHub
Implementation of paper Long-Term Effect Estimation with Surrogate Representation
☆13Oct 20, 2020Updated 5 years ago
MaxDu17 / BehaviorRetrieval
View on GitHub
Code for the Behavior Retrieval Paper
☆35Jul 24, 2023Updated 3 years ago
ArnaudFickinger / adversarial-surprise
View on GitHub
Explore and Control with Adversarial Surprise
☆10Jul 20, 2021Updated 5 years ago
hello-robot / stretch_web_interface
View on GitHub
Prototype web interface that enables remote teleoperation of the Stretch RE1 mobile manipulator from Hello Robot Inc.
☆12Dec 14, 2023Updated 2 years ago