WEIRDLabUW / vpl_llmLinks
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆22Updated last year
Alternatives and similar repositories for vpl_llm
Users that are interested in vpl_llm are comparing it to the libraries listed below
Sorting:
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆78Updated 5 months ago
- Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)☆18Updated last year
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆45Updated 10 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆32Updated last year
- ☆33Updated last year
- Direct preference optimization with f-divergences.☆15Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆77Updated last year
- Code for Contrastive Preference Learning (CPL)☆176Updated 11 months ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning☆15Updated 3 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Rewarded soups official implementation☆62Updated 2 years ago
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆253Updated 2 months ago
- Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)☆24Updated last year
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated 2 years ago
- Paper collections of the continuous effort start from World Models.☆188Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆197Updated 7 months ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆47Updated 2 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- Code for "Goal-Conditioned Predictive Coding for Offline Reinforcement Learning" (NeurIPS 2023)☆12Updated last year
- ☆105Updated last year
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆34Updated 2 years ago
- Official code repository for Prompt-DT.☆117Updated 3 years ago
- ☆51Updated 3 years ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆381Updated last year
- A curated list of causal reinforcement learning resources.☆105Updated last year
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆44Updated 4 years ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆92Updated last year
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆129Updated last year
- ☆16Updated last year