mschweizer/Pref-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mschweizer/Pref-RL)

mschweizer / Pref-RL

Pref-RL provides ready-to-use PbRL agents that are easily extensible.

☆11

Alternatives and similar repositories for Pref-RL

Users that are interested in Pref-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year
rll-research / rune
View on GitHub
Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
☆15May 26, 2022Updated 4 years ago
chwoong / LiRE
View on GitHub
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
☆18Jun 18, 2024Updated 2 years ago
Tony-Tan / Reinforcement-Learning
View on GitHub
☆11Feb 25, 2025Updated last year
machine-intelligence / rl-teacher-atari
View on GitHub
(This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…
☆29Jan 22, 2019Updated 7 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
YanpengLuo / Towards-Problem-of-First-Miss-under-Mobile-EdgeCaching
View on GitHub
The source code of the paper "Towards Problem of First Miss under Mobile EdgeCaching"
☆11Apr 12, 2021Updated 5 years ago
dannigt / mid-align
View on GitHub
☆15Sep 30, 2025Updated 9 months ago
Wenminggong / PbRL_for_PHRI
View on GitHub
code for "Decoupled Preference-based Reinforcement Learning for Personalized Human-Robot Interaction"
☆11Jul 9, 2022Updated 4 years ago
pushkar8723 / paper-dropdown
View on GitHub
A wrapper for paper-dropdown-menu to enable various features like multi-select, search / filter of items, key value pair and 2-way bindin…
☆16Sep 25, 2019Updated 6 years ago
zehao-wang / iPPD-sem
View on GitHub
☆21Apr 23, 2025Updated last year
HumanCompatibleAI / seals
View on GitHub
Benchmark environments for reward modelling and imitation learning algorithms.
☆47Sep 19, 2023Updated 2 years ago
mathchi / Customer-Segmentation-with-RFM-Analysis
View on GitHub
Context A real online retail transaction data set of two years. Content This Online Retail II data set contains all the transactions oc…
☆18Jul 5, 2020Updated 6 years ago
jhejna / inverse-preference-learning
View on GitHub
☆43May 25, 2023Updated 3 years ago
callummcdougall / TransformerLens-intro
View on GitHub
☆20Jan 28, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
holken / polite
View on GitHub
code for polite
☆12Feb 28, 2024Updated 2 years ago
apple / ml-reed
View on GitHub
☆13Feb 5, 2024Updated 2 years ago
DarriusL / CoCheLab
View on GitHub
Code for the content caching algorithm in edge caching.
☆22Sep 24, 2024Updated last year
sohamghosh121 / PacmanGym
View on GitHub
Open AI Gym version of Berkeley AI Pacman with images as states
☆13May 4, 2018Updated 8 years ago
acsresearch / interlab
View on GitHub
☆22Jul 18, 2024Updated 2 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
ryan-p-randall / monthly-planning-files
View on GitHub
Text files to help plan & log whatever it is you do. Bullet journal + pomodoro technique + text editors + cloud syncing = progress.
☆16Aug 7, 2021Updated 4 years ago
capptions / iron-swiper
View on GitHub
Polymer element that wraps Swiper.js
☆22Apr 16, 2023Updated 3 years ago
MaKleSoft / gulp-style-modules
View on GitHub
A gulp plugin for wrapping css into style modules as used by Polymer
☆24Nov 16, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
emylincon / caching_project
View on GitHub
implementation of cooperative caching algorithm for edge computing
☆17May 9, 2023Updated 3 years ago
DhrubaDC1 / deep-learning-caching
View on GitHub
Deep learning based predictive analytics for efficient content caching in edge network
☆18Dec 26, 2022Updated 3 years ago
winkyao / join-order-benchmark
View on GitHub
☆14Apr 24, 2023Updated 3 years ago
solislemuslab / tropical-stethoscope
View on GitHub
Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)
☆13Oct 16, 2023Updated 2 years ago
huaweicloud / c2far_forecasting
View on GitHub
This repository contains code for the paper: S Bergsma, T Zeyl, JR Anaraki, L Guo, C2FAR: Coarse-to-Fine Autoregressive Networks for Prec…
☆13Dec 7, 2023Updated 2 years ago
ymetz / rlhfblender
View on GitHub
RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
☆14May 19, 2026Updated 2 months ago
anishmadan23 / MAML_Pytorch_RL
View on GitHub
☆10Aug 8, 2021Updated 4 years ago
jmhIcoding / english
View on GitHub
单词记忆
☆11Sep 7, 2018Updated 7 years ago
tianxusky / Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yunjiazhang / adaptiveness_vs_learning
View on GitHub
☆12Sep 13, 2024Updated last year
ImprintLab / SPA
View on GitHub
SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)
☆16Sep 26, 2025Updated 9 months ago
kaiwenw / JoinGym
View on GitHub
A lightweight RL environment for query optimization.
☆16Sep 13, 2024Updated last year
thejaminator / latteries
View on GitHub
James' cookbook of evaluations and finetuning experiments
☆32Feb 19, 2026Updated 5 months ago
huxiao09 / QPA
View on GitHub
☆13Sep 24, 2024Updated last year
qw4990 / index_advisor
View on GitHub
☆14Oct 8, 2023Updated 2 years ago
salubinseid / mobility-aware-caching-iov-icn
View on GitHub
Mobility-Aware Proactive Edge Caching OptimizationScheme in Information-Centric IoV Networks
☆21Jan 20, 2022Updated 4 years ago