chwoong/LiRE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chwoong/LiRE)

chwoong / LiRE

Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)

☆18

Alternatives and similar repositories for LiRE

Users that are interested in LiRE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
jhejna / inverse-preference-learning
View on GitHub
☆43May 25, 2023Updated 3 years ago
rll-research / BPref
View on GitHub
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆136Nov 3, 2021Updated 4 years ago
csmile-1006 / PreferenceTransformer
View on GitHub
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆168Oct 15, 2023Updated 2 years ago
huxiao09 / QPA
View on GitHub
☆13Sep 24, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jhejna / few-shot-preference-rl
View on GitHub
☆38Apr 27, 2023Updated 3 years ago
CJReinforce / RIME_ICML2024
View on GitHub
Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)
☆36Oct 15, 2024Updated last year
apple / ml-reed
View on GitHub
☆13Feb 5, 2024Updated 2 years ago
Zzl35 / flow-to-better
View on GitHub
☆27Apr 22, 2024Updated 2 years ago
mschweizer / Pref-RL
View on GitHub
Pref-RL provides ready-to-use PbRL agents that are easily extensible.
☆11Aug 31, 2022Updated 3 years ago
Facebear-ljx / RGM
View on GitHub
The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
☆16Mar 3, 2023Updated 3 years ago
snu-mllab / DPPO
View on GitHub
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
☆43Jul 20, 2024Updated 2 years ago
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year
holken / polite
View on GitHub
code for polite
☆12Feb 28, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
mahaozhe / SASR
View on GitHub
[ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)
☆12Aug 26, 2025Updated 10 months ago
Guozheng-Ma / Adaptive-Replay-Ratio
View on GitHub
[ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.
☆13Oct 9, 2024Updated last year
NHirose / ExAug
View on GitHub
☆11Mar 15, 2023Updated 3 years ago
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
circle-hit / Lens
View on GitHub
Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"
☆12Oct 15, 2024Updated last year
polixir / morec
View on GitHub
☆10Mar 11, 2024Updated 2 years ago
mansicer / self-verification
View on GitHub
☆18Dec 23, 2025Updated 7 months ago
zzfoutofspace / ATPO
View on GitHub
AT2PO: Agentic Turn-based Policy Optimization via Tree Search
☆22May 21, 2026Updated 2 months ago
solislemuslab / tropical-stethoscope
View on GitHub
Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)
☆13Oct 16, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jmhIcoding / english
View on GitHub
单词记忆
☆11Sep 7, 2018Updated 7 years ago
bsaund / rviz_voxelgrid_visuals
View on GitHub
Pluggin and utils for viewing voxelgrids in RViz
☆11Jun 14, 2021Updated 5 years ago
hgkahng / self-supervised-learning
View on GitHub
PyTorch implementations of self-supervised learning algorithms.
☆14Jan 14, 2025Updated last year
ldery / Bonsai
View on GitHub
Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"
☆32Mar 28, 2024Updated 2 years ago
rperezdattari / Interactive-Learning-of-Temporal-Features-for-Control
View on GitHub
Code of the paper "Interactive Learning of Temporal Feature for Control", published in the IEEE Robotics & Automation Magazine.
☆12Dec 27, 2022Updated 3 years ago
lucys0 / awe
View on GitHub
Waypoint-Based Imitation Learning for Robotic Manipulation
☆145Mar 13, 2024Updated 2 years ago
lisir233 / esp_smart_light_controller
View on GitHub
☆13Dec 3, 2023Updated 2 years ago
nissymori / remax-rl
View on GitHub
[ICML2026] Official JAX code for Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
☆15Jul 3, 2026Updated 3 weeks ago
seohongpark / HIQL
View on GitHub
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆98Dec 1, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ImprintLab / SPA
View on GitHub
SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)
☆16Sep 26, 2025Updated 9 months ago
GeWu-Lab / Action-Preference-Optimization
View on GitHub
☆16Oct 26, 2025Updated 8 months ago
FelipeNuti / diffusion-relative-rewards
View on GitHub
Codebase for Extracting Reward Functions from Diffusion Models
☆16Dec 7, 2023Updated 2 years ago
pokaxpoka / B_Pref
View on GitHub
☆54Nov 10, 2022Updated 3 years ago
dsbrown1331 / bayesianrex
View on GitHub
☆21Dec 17, 2020Updated 5 years ago
ethanluoyc / optimal_transport_reward
View on GitHub
☆18Apr 11, 2024Updated 2 years ago
RyanLiu112 / MRN
View on GitHub
[NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…
☆26Feb 15, 2025Updated last year