microsoft / HuRLLinks

Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper

☆15

Alternatives and similar repositories for HuRL

Users that are interested in HuRL are comparing it to the libraries listed below

Sorting:

xingchenwan / bgpbt
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆28Updated 2 years ago
microsoft / strategically_efficient_rl
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Updated 11 months ago
facebookresearch / ssorl
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
☆42Updated 2 years ago
zhxieml / PDT
Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer
☆28Updated last year
CarperAI / Algorithm-Distillation-RLHF
☆34Updated 2 years ago
TrentBrick / RewardConditionedUDRL
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆18Updated 4 years ago
microsoft / autorl-research
The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆55Updated this week
ReinholdM / Papers-of-Offline-RL
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Updated 3 years ago
sfujim / SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆17Updated 3 years ago
ben-eysenbach / info_geometry
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Updated 3 years ago
KyunghyunLee / aes-rl
☆17Updated 4 years ago
Mehooz / BIRD_code
Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".
☆14Updated 4 years ago
teslacool / m-curl
M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning
☆29Updated 4 years ago
tianjunz / MADE
☆19Updated 4 years ago
google-deepmind / active_ops
☆32Updated 11 months ago
WorldEditors / EvolvingPlasticANN
Codes for Evolving Plastic ANNs
☆13Updated 2 years ago
LunjunZhang / world-model-as-a-graph
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
☆66Updated 4 years ago
mansicer / Q-Adapter
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆17Updated 9 months ago
yiqiwang8177 / Official-codebase-for-Decision-Transducer
This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…
☆11Updated last year
frt03 / jax_dt
Minimal Decision Transformer Implementation written in Jax (Flax).
☆17Updated 2 years ago
pickxiguapi / Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …
☆38Updated last year
daniellawson9999 / online-decision-transformer
An unofficial implementation for online decision transformer
☆40Updated 2 years ago
adaptive-intelligent-robotics / QDAC
Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …
☆16Updated last year
ml-jku / align-rudder
Code to reproduce results on toy tasks and companion blog for the paper.
☆20Updated 3 years ago
maohangyu / PDiT
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presenta…
☆10Updated last year
icaros-usc / dqd-rl
Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"
☆20Updated 2 years ago
Stanford-ILIAD / Conventions-ModularPolicy
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
☆16Updated 4 years ago
younggyoseo / trajectory_mcl
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Updated 4 years ago
XanderJC / attention-based-credit
Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…
☆33Updated 11 months ago
ucl-dark / pax
Scalable Opponent Shaping Experiments in JAX
☆24Updated last year