pokaxpoka/B_Pref

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pokaxpoka/B_Pref)

pokaxpoka / B_Pref

☆53

Alternatives and similar repositories for B_Pref

Users that are interested in B_Pref are comparing it to the libraries listed below

Sorting:

rll-research / BPref
View on GitHub
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆133Nov 3, 2021Updated 4 years ago
jhejna / few-shot-preference-rl
View on GitHub
☆37Apr 27, 2023Updated 2 years ago
rll-research / rune
View on GitHub
Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
☆15May 26, 2022Updated 3 years ago
csmile-1006 / PreferenceTransformer
View on GitHub
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆167Oct 15, 2023Updated 2 years ago
Miffyli / rl-human-prior-tricks
View on GitHub
Evaluating different engineering tricks that make RL work
☆15Jun 3, 2021Updated 4 years ago
csmile-1006 / ARP
View on GitHub
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
☆33Sep 25, 2023Updated 2 years ago
lili-chen / SEER
View on GitHub
Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.
☆21Mar 5, 2021Updated 5 years ago
mila-iqia / SGI
View on GitHub
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆55Jul 27, 2021Updated 4 years ago
younggyoseo / trajectory_mcl
View on GitHub
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Oct 27, 2020Updated 5 years ago
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
Manchery / iql-pytorch
View on GitHub
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
☆24Nov 4, 2024Updated last year
Pearl-UTexas / EMPATHIC
View on GitHub
☆10Oct 3, 2023Updated 2 years ago
WorldEditors / EvolvingPlasticANN
View on GitHub
Codes for Evolving Plastic ANNs
☆14Dec 18, 2022Updated 3 years ago
zhxieml / PDT
View on GitHub
Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer
☆29Jul 25, 2023Updated 2 years ago
valeriechen / ask-your-humans
View on GitHub
Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"
☆11Apr 8, 2025Updated 10 months ago
rll-research / teachable
View on GitHub
☆17Oct 12, 2023Updated 2 years ago
apple / ml-reed
View on GitHub
☆13Feb 5, 2024Updated 2 years ago
anair13 / bullet-manipulation-affordances
View on GitHub
☆13Jun 3, 2022Updated 3 years ago
uoe-agents / derl
View on GitHub
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆27Feb 3, 2022Updated 4 years ago
HumanCompatibleAI / learning-from-human-preferences
View on GitHub
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆31Jul 27, 2021Updated 4 years ago
laurimi / multiagent-prediction-reward
View on GitHub
Multi-agent active perception with prediction rewards
☆11Nov 13, 2020Updated 5 years ago
muttimirco / mepol
View on GitHub
Implementation of the MEPOL algorithm - A policy gradient method for task-agnostic exploration
☆15Jul 6, 2023Updated 2 years ago
yiqiwang8177 / Official-codebase-for-Decision-Transducer
View on GitHub
This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…
☆11Oct 9, 2023Updated 2 years ago
mrahtz / learning-from-human-preferences
View on GitHub
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆333Nov 29, 2021Updated 4 years ago
Stanford-ILIAD / APReL
View on GitHub
A Library for Active Preference-based Reward Learning Algorithms
☆54Dec 16, 2023Updated 2 years ago
hari-sikchi / AWAC
View on GitHub
Advantage weighted Actor Critic for Offline RL
☆52Aug 27, 2022Updated 3 years ago
ShuangLI59 / Pre-Trained-Language-Models-for-Interactive-Decision-Making
View on GitHub
Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]
☆130Jun 8, 2022Updated 3 years ago
minnesotanlp / infoVerse
View on GitHub
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…
☆16Jun 28, 2023Updated 2 years ago
chwoong / LiRE
View on GitHub
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
☆17Jun 18, 2024Updated last year
yuqingd / cusp
View on GitHub
☆15Sep 7, 2022Updated 3 years ago
yuqingd / sim2real2sim_rad
View on GitHub
☆58Jun 30, 2022Updated 3 years ago
younggyoseo / MV-MWM
View on GitHub
☆60Apr 16, 2023Updated 2 years ago
LunjunZhang / world-model-as-a-graph
View on GitHub
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
☆68Jul 17, 2021Updated 4 years ago
shlee94 / Off2OnRL
View on GitHub
☆60Feb 3, 2023Updated 3 years ago
siddharthverma314 / clcp-neurips-2020
View on GitHub
Code for Continual Learning of Control Primitives
☆18Nov 11, 2020Updated 5 years ago
Mehooz / BIRD_code
View on GitHub
Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".
☆14May 23, 2021Updated 4 years ago
akakzia / decstr
View on GitHub
☆15Aug 9, 2021Updated 4 years ago
david-lindner / idrl
View on GitHub
Code accompanying the paper "Information Directed Reward Learning for Reinforcement Learning" (NeurIPS 2021).
☆13Nov 16, 2021Updated 4 years ago
kingdy2002 / VCSE
View on GitHub
☆18Jun 8, 2023Updated 2 years ago