cassidylaidlaw / hidden-context
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆26Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for hidden-context
- Rewarded soups official implementation☆49Updated last year
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆46Updated this week
- Representation Learning in RL☆16Updated 2 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆20Updated 2 months ago
- Implements the Messenger environment and EMMA model.☆22Updated last year
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆23Updated 10 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆52Updated 2 months ago
- ☆26Updated last year
- ☆24Updated 6 months ago
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆31Updated last year
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆14Updated last year
- ☆25Updated last week
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆10Updated 3 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆51Updated last month
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆38Updated 3 months ago
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆13Updated 4 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 2 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆10Updated 2 years ago
- ☆14Updated 9 months ago
- ☆15Updated 3 years ago
- ☆14Updated 4 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆14Updated 11 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆38Updated 9 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated last year
- ☆28Updated last year
- ☆11Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆24Updated last month