WEIRDLabUW / vpl_llmLinks
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆22Updated last year
Alternatives and similar repositories for vpl_llm
Users that are interested in vpl_llm are comparing it to the libraries listed below
Sorting:
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆77Updated 4 months ago
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆44Updated 9 months ago
- Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)☆17Updated last year
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- Direct preference optimization with f-divergences.☆14Updated 11 months ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning☆15Updated 3 years ago
- Rewarded soups official implementation☆60Updated 2 years ago
- ☆32Updated last year
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆32Updated last year
- Code for Contrastive Preference Learning (CPL)☆176Updated 11 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆91Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆196Updated 6 months ago
- ☆24Updated 2 years ago
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆243Updated last month
- Paper collections of the continuous effort start from World Models.☆186Updated last year
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆379Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆76Updated last year
- maze datasets for investigating OOD behavior of ML systems☆64Updated last week
- Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty☆38Updated 2 months ago
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated 2 years ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Updated 3 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated 2 years ago
- ☆23Updated 4 months ago
- [TNNLS-2024, arXiv-2023.2.10] Official repository of "A Survey on Causal Reinforcement Learning"☆52Updated 2 months ago
- Code for Invariant Policy Optimization☆12Updated 5 years ago
- Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)☆23Updated last year
- ☆51Updated 2 years ago
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆189Updated 2 months ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆43Updated 4 years ago