csmile-1006 / PreferenceTransformerLinks

Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)

☆163

Alternatives and similar repositories for PreferenceTransformer

Users that are interested in PreferenceTransformer are comparing it to the libraries listed below

Sorting:

mxu34 / prompt-dt
Official code repository for Prompt-DT.
☆114Updated 3 years ago
rll-research / BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆129Updated 3 years ago
YaoMarkMu / Awesome-Pretrained-RL
☆89Updated 2 years ago
conglu1997 / SynthER
Synthetic Experience Replay
☆95Updated last year
facebookresearch / online-dt
Online Decision Transformer
☆263Updated last year
srzer / LaMo-2023
Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".
☆53Updated last year
Haichao-Zhang / PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
☆59Updated 2 years ago
ReinholdM / Offline-Pre-trained-Multi-Agent-Decision-Transformer
☆112Updated 2 years ago
Howuhh / faster-trajectory-transformer
Implementation of Trajectory Transformer with attention caching and batched beam search
☆115Updated 2 years ago
elicassion / StARformer
[ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.
☆94Updated 2 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆175Updated 3 years ago
snu-mllab / EDAC
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
☆75Updated 2 years ago
Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆87Updated last year
jhejna / inverse-preference-learning
☆42Updated 2 years ago
ryanxhr / POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆57Updated 2 years ago
NJU-RL / Meta-DT
[NeurIPS 2024] Official Implementation of Meta-DT
☆45Updated 9 months ago
danielshin1 / oprl
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Updated 2 years ago
ml-jku / L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
☆59Updated 10 months ago
jon--lee / decision-pretrained-transformer
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…
☆69Updated last year
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆146Updated last year
dmksjfl / MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆58Updated last year
yuqingd / ellm
☆81Updated last year
Div99 / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆87Updated 2 years ago
YiqinYang / ICQ
Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…
☆74Updated 2 years ago
etaoxing / multigame-dt
Implementation of Multi-Game Decision Transformers in PyTorch
☆47Updated 2 years ago
liuzuxin / OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆207Updated 10 months ago
nakamotoo / Cal-QL
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
☆104Updated last year
fuyw / RepL4RL
Representation Learning for RL
☆126Updated 2 years ago
young-geng / CQL
Conservative Q Learning on top of SAC
☆132Updated 2 years ago
chwoong / LiRE
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
☆15Updated last year