danielshin1 / oprlView external linksLinks
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
Alternatives and similar repositories for oprl
Users that are interested in oprl are comparing it to the libraries listed below
Sorting:
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Jun 18, 2024Updated last year
- ☆43May 25, 2023Updated 2 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆133Nov 3, 2021Updated 4 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 10 months ago
- This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…☆11Feb 6, 2025Updated last year
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- ☆37Apr 27, 2023Updated 2 years ago
- ☆33Aug 30, 2024Updated last year
- Flow RL is a high-performance RL library with flow and diffusion models.☆27Feb 6, 2026Updated last week
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 2 years ago
- ☆18Apr 11, 2024Updated last year
- ☆20Mar 19, 2024Updated last year
- Public implementation of "Encoding Human Domain Knowledge to Warm Start Reinforcement Learning" from AAAI'21☆20Mar 5, 2024Updated last year
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Jul 20, 2024Updated last year
- ☆17Dec 30, 2024Updated last year
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- [NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…☆26Feb 15, 2025Updated last year
- Featurized Density Ratio Estimation☆20Jul 11, 2021Updated 4 years ago
- Code for FOCAL Paper Published at ICLR 2021☆55Dec 4, 2023Updated 2 years ago
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆61Jul 22, 2025Updated 6 months ago
- Standalone library of frequently-used wrappers for dm_env environments.☆18Jul 9, 2024Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆22Apr 17, 2024Updated last year
- Inverse Constrained Reinforcement Learning (ICML 2021)☆25Aug 18, 2021Updated 4 years ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- Code for Contrastive Preference Learning (CPL)☆178Nov 22, 2024Updated last year
- Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.☆35Oct 7, 2024Updated last year
- ☆31Aug 25, 2022Updated 3 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 3 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Jan 27, 2026Updated 2 weeks ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆34Oct 15, 2024Updated last year
- Model-based Offline Policy Optimization re-implement all by pytorch☆39Sep 13, 2023Updated 2 years ago
- Implementation of Sim2Seg (John So*, Amber Xie*, Sunggoo Jung, Jeffrey Edlund, Rohan Thakker, Ali-akbar Agha-mohammad, Pieter Abbeel, Ste…☆36Aug 26, 2023Updated 2 years ago