WEIRDLabUW / vpl_llmLinks
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆19Updated 11 months ago
Alternatives and similar repositories for vpl_llm
Users that are interested in vpl_llm are comparing it to the libraries listed below
Sorting:
- A list of papers regarding generalization in (deep) reinforcement learning☆151Updated 2 years ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning☆15Updated 3 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆30Updated last year
- Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)☆16Updated last year
- ☆20Updated 3 years ago
- Object Centric Atari games☆86Updated 3 weeks ago
- ☆50Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆70Updated last year
- Code for Contrastive Preference Learning (CPL)☆174Updated 8 months ago
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆123Updated 2 years ago
- Official code repository for Prompt-DT.☆114Updated 3 years ago
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆34Updated 2 years ago
- ☆100Updated last year
- ☆20Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Rewarded soups official implementation☆58Updated last year
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆40Updated 4 years ago
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆72Updated 2 months ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆44Updated 3 years ago
- A curated list of causal reinforcement learning resources.☆96Updated last year
- Code for the paper "Learning Options via Compression" at NeurIPS 2022☆24Updated 2 years ago
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆24Updated 2 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 4 years ago
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆45Updated 6 months ago
- ☆16Updated 9 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- ☆34Updated 2 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆47Updated 2 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆12Updated 3 years ago
- Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)☆23Updated last year