cassidylaidlaw / hidden-context
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆27Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for hidden-context
- Rewarded soups official implementation☆51Updated last year
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- Representation Learning in RL☆16Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 2 years ago
- ☆28Updated last year
- Clean, extensible implementation of MACAW [ICML 2021]☆11Updated 2 years ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆14Updated last year
- ☆25Updated 3 weeks ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- ☆15Updated 3 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆10Updated 3 years ago
- ☆11Updated 2 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆20Updated 3 months ago
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆48Updated 2 weeks ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆16Updated 2 years ago
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆32Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- ☆24Updated 7 months ago
- ☆36Updated last year
- ☆26Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated last year
- Generalised UDRL☆37Updated 2 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆30Updated last year
- ☆29Updated 3 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆28Updated last year
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆16Updated 3 years ago