Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
☆17Jun 18, 2024Updated last year
Alternatives and similar repositories for LiRE
Users that are interested in LiRE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆43May 25, 2023Updated 2 years ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆36Oct 15, 2024Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆134Nov 3, 2021Updated 4 years ago
- ☆13Feb 5, 2024Updated 2 years ago
- This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…☆12Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning☆15May 26, 2022Updated 3 years ago
- Pref-RL provides ready-to-use PbRL agents that are easily extensible.☆11Aug 31, 2022Updated 3 years ago
- Code for the Behavior Retrieval Paper☆35Jul 24, 2023Updated 2 years ago
- ☆37Apr 27, 2023Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆79Feb 19, 2026Updated last month
- 🍓 A toy object-oriented programming language written by rust☆17Apr 10, 2024Updated 2 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Mar 15, 2023Updated 3 years ago
- SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)☆15Sep 26, 2025Updated 6 months ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- 2023 ABCI Llama-2 継続学習プロジェクト☆14Jan 22, 2024Updated 2 years ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"☆11Oct 15, 2024Updated last year
- ☆10Mar 11, 2024Updated 2 years ago
- ☆14Oct 11, 2022Updated 3 years ago
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Mar 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)☆13Oct 16, 2023Updated 2 years ago
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆13Mar 23, 2026Updated 3 weeks ago
- 音声を文字起こししてChatGPTと会話したい☆22Mar 8, 2023Updated 3 years ago
- ☆11Apr 22, 2022Updated 3 years ago
- Source code for the IROS21 paper Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective☆11Aug 2, 2021Updated 4 years ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆24Oct 11, 2025Updated 6 months ago
- python implementation of the Viola-Jones algorithm in rapid face detection☆11Dec 3, 2018Updated 7 years ago
- Pluggin and utils for viewing voxelgrids in RViz☆11Jun 14, 2021Updated 4 years ago
- 哔哩哔哩常用API调用。☆17Aug 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Waypoint-Based Imitation Learning for Robotic Manipulation☆140Mar 13, 2024Updated 2 years ago
- ☆13Dec 3, 2023Updated 2 years ago
- [ICML 2024] Learning Reward for Robot Skills Using Large Language Models via Self-Alignment☆18Aug 22, 2024Updated last year
- Codebase for Extracting Reward Functions from Diffusion Models☆16Dec 7, 2023Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆95Dec 1, 2024Updated last year
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆30Mar 28, 2024Updated 2 years ago
- ☆16Apr 2, 2025Updated last year