WEIRDLabUW / vpl_llmLinks
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆18Updated 9 months ago
Alternatives and similar repositories for vpl_llm
Users that are interested in vpl_llm are comparing it to the libraries listed below
Sorting:
- Reinforcement Learning via Regressing Relative Rewards☆33Updated 5 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆29Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 8 months ago
- Official PyTorch implementation of NPwSA: "Neural Processes with Stochastic Attention: Paying more attention to the context dataset (ICLR…☆10Updated 2 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆43Updated 4 months ago
- [CLeaR23] Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning☆30Updated 2 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- ☆15Updated 2 years ago
- ☆12Updated 2 years ago
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆31Updated last month
- ☆16Updated last year
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆34Updated 2 years ago
- ☆23Updated 8 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Updated last year
- ☆17Updated last year
- ☆27Updated last year
- Distributional and Outlier Robust Optimization (ICML 2021)☆27Updated 3 years ago
- Code repository of the paper "CITRIS: Causal Identifiability from Temporal Intervened Sequences" and "iCITRIS: Causal Representation Lear…☆52Updated last year
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 10 months ago
- Rewarded soups official implementation☆58Updated last year
- Accompanies the EMNLP 2024 paper: "Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions". This repo featur…☆19Updated 4 months ago
- Self-Supervised Alignment with Mutual Information☆19Updated last year
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Official Implementation of the paper: "A Rate-Distorion View of Uncertainty Quantification", ICML 2024☆28Updated 9 months ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- ☆29Updated last year
- Official PyTorch implementation of "Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data" (NeurIPS'23)☆15Updated last year
- Generalised UDRL☆37Updated 3 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆9Updated 4 months ago