Pref-RL provides ready-to-use PbRL agents that are easily extensible.
☆11Aug 31, 2022Updated 3 years ago
Alternatives and similar repositories for Pref-RL
Users that are interested in Pref-RL are comparing it to the libraries listed below
Sorting:
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 11 months ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning☆15May 26, 2022Updated 3 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Jun 18, 2024Updated last year
- ☆11Feb 25, 2025Updated last year
- (This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…☆29Jan 22, 2019Updated 7 years ago
- The source code of the paper "Towards Problem of First Miss under Mobile EdgeCaching"☆11Apr 12, 2021Updated 4 years ago
- code for "Decoupled Preference-based Reinforcement Learning for Personalized Human-Robot Interaction"☆11Jul 9, 2022Updated 3 years ago
- A wrapper for paper-dropdown-menu to enable various features like multi-select, search / filter of items, key value pair and 2-way bindin…☆16Sep 25, 2019Updated 6 years ago
- ☆21Apr 23, 2025Updated 11 months ago
- ☆43May 25, 2023Updated 2 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- ☆19Jan 28, 2024Updated 2 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)☆15Sep 26, 2025Updated 5 months ago
- ☆13Feb 5, 2024Updated 2 years ago
- Open AI Gym version of Berkeley AI Pacman with images as states☆13May 4, 2018Updated 7 years ago
- ☆22Jul 18, 2024Updated last year
- Context A real online retail transaction data set of two years. Content This Online Retail II data set contains all the transactions oc…☆18Jul 5, 2020Updated 5 years ago
- Text files to help plan & log whatever it is you do. Bullet journal + pomodoro technique + text editors + cloud syncing = progress.☆15Aug 7, 2021Updated 4 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- Polymer element that wraps Swiper.js☆22Apr 16, 2023Updated 2 years ago
- Code for the content caching algorithm in edge caching.☆22Sep 24, 2024Updated last year
- A gulp plugin for wrapping css into style modules as used by Polymer☆24Nov 16, 2016Updated 9 years ago
- implementation of cooperative caching algorithm for edge computing☆17May 9, 2023Updated 2 years ago
- Deep learning based predictive analytics for efficient content caching in edge network☆18Dec 26, 2022Updated 3 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- 单词记忆☆10Sep 7, 2018Updated 7 years ago
- OCTCube-M: A 3D multimodal optical coherence tomography foundation model for retinal and systemic diseases with cross-cohort and cross-de…☆22Jun 30, 2025Updated 8 months ago
- ☆14Apr 24, 2023Updated 2 years ago
- Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)☆13Oct 16, 2023Updated 2 years ago
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆13Updated this week
- This repository contains code for the paper: S Bergsma, T Zeyl, JR Anaraki, L Guo, C2FAR: Coarse-to-Fine Autoregressive Networks for Prec…☆13Dec 7, 2023Updated 2 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- A transparent, single-file implementation for understanding GRPO (K1 in Rewards), free from the abstractions of large libraries.☆47Feb 14, 2026Updated last month
- Region Encoder Network☆18Oct 2, 2025Updated 5 months ago
- ☆12Sep 13, 2024Updated last year
- A lightweight RL environment for query optimization.☆16Sep 13, 2024Updated last year
- ☆13Sep 24, 2024Updated last year
- ☆14Oct 8, 2023Updated 2 years ago