danielshin1/oprl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/danielshin1/oprl)

danielshin1 / oprl

Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning

☆20

Alternatives and similar repositories for oprl

Users that are interested in oprl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jhejna / inverse-preference-learning
View on GitHub
☆43May 25, 2023Updated 3 years ago
rll-research / BPref
View on GitHub
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆136Nov 3, 2021Updated 4 years ago
sparkmxy / my-offlinerl
View on GitHub
☆26Jun 14, 2022Updated 4 years ago
catezi / MAPT
View on GitHub
This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…
☆12Apr 9, 2026Updated 3 months ago
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
jhejna / few-shot-preference-rl
View on GitHub
☆38Apr 27, 2023Updated 3 years ago
Facebear-ljx / RGM
View on GitHub
The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
☆16Mar 3, 2023Updated 3 years ago
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year
rraileanu / policy-dynamics-value-functions
View on GitHub
☆33Aug 30, 2024Updated last year
RyanLiu112 / MRN
View on GitHub
[NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…
☆26Feb 15, 2025Updated last year
ttumiel / minRLHF
View on GitHub
Minimal RLHF implementation built on top of minGPT.
☆32Jul 4, 2024Updated 2 years ago
csmile-1006 / PreferenceTransformer
View on GitHub
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆168Oct 15, 2023Updated 2 years ago
snu-mllab / DPPO
View on GitHub
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
☆43Jul 20, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LanqingLi1993 / FOCAL-ICLR
View on GitHub
Code for FOCAL Paper Published at ICLR 2021
☆55Dec 4, 2023Updated 2 years ago
pickxiguapi / Uni-RLHF-Platform
View on GitHub
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…
☆42Nov 20, 2024Updated last year
jhejna / cpl
View on GitHub
Code for Contrastive Preference Learning (CPL)
☆184Nov 22, 2024Updated last year
XuGW-Kevin / DrM
View on GitHub
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆78Feb 19, 2026Updated 5 months ago
microsoft / autorl-research
View on GitHub
The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆63Jul 22, 2025Updated last year
KAIST-AILab / imitation-dice
View on GitHub
☆17Dec 30, 2024Updated last year
mahaozhe / SASR
View on GitHub
[ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)
☆12Aug 26, 2025Updated 11 months ago
mumu12641 / strawberry
View on GitHub
🍓 A toy object-oriented programming language written by rust
☆17Apr 10, 2024Updated 2 years ago
WorldEditors / EvolvingPlasticANN
View on GitHub
Codes for Evolving Plastic ANNs
☆15Dec 18, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
instadeepai / outer-value-function-meta-rl
View on GitHub
Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
☆13Apr 13, 2026Updated 3 months ago
ldcq / ldcq
View on GitHub
☆35May 24, 2023Updated 3 years ago
CJReinforce / RIME_ICML2024
View on GitHub
Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)
☆36Oct 15, 2024Updated last year
typoverflow / UtilsRL
View on GitHub
A python module designed for agile RL algorithm developing.
☆26Jul 11, 2024Updated 2 years ago
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
Guozheng-Ma / Adaptive-Replay-Ratio
View on GitHub
[ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.
☆13Oct 9, 2024Updated last year
junming-yang / mopo
View on GitHub
Model-based Offline Policy Optimization re-implement all by pytorch
☆43Sep 13, 2023Updated 2 years ago
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
yihaosun1124 / OfflineRL-Kit
View on GitHub
An elegant PyTorch offline reinforcement learning library for researchers.
☆393May 2, 2026Updated 2 months ago
polixir / OfflineRL
View on GitHub
A collection of offline reinforcement learning algorithms.
☆211Nov 26, 2024Updated last year
ikostrikov / implicit_q_learning
View on GitHub
☆330Jan 23, 2022Updated 4 years ago
amujika / Open-Ended-Reinforcement-Learning-with-Neural-Reward-Functions
View on GitHub
☆14Oct 11, 2022Updated 3 years ago
niudong1001 / learn-ai
View on GitHub
存储在学习人工智能（AI）中涉及到的各种基础知识，工具，模型，算法，代码等。
☆14Mar 10, 2019Updated 7 years ago
maxjcohen / vqvae
View on GitHub
VQ-VAE implementation in pytorch, supporting EMA and Gumbel trainings. Applicable for images and time series.
☆11Oct 19, 2022Updated 3 years ago