WEIRDLabUW / vpl_llm
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆16Updated 7 months ago
Alternatives and similar repositories for vpl_llm:
Users that are interested in vpl_llm are comparing it to the libraries listed below
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆29Updated last year
- ☆31Updated 3 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Code repository complementing the ICLR 2021 paper "Unsupervised Object Keypoint Learning using Local Spatial Predictability" (https://arx…☆9Updated 2 months ago
- Generalised UDRL☆37Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 6 months ago
- ☆16Updated last year
- Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning☆11Updated last month
- Representation Learning in RL☆16Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆16Updated 3 years ago
- Official implementation of the paper "Interventions, Where and How? Experimental Design for Causal Models at Scale", NeurIPS 2022.☆19Updated 2 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- ☆37Updated 8 months ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated 8 months ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated 9 months ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- PyTorch Package For Quasimetric Learning☆41Updated 5 months ago
- Variational Reinforcement Learning☆16Updated 8 months ago
- Code for Dataset and Benchmarks Submission, Neurips 2022☆13Updated 2 years ago
- [CLeaR23] Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning☆30Updated last year
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- Official PyTorch implementation of NPwSA: "Neural Processes with Stochastic Attention: Paying more attention to the context dataset (ICLR…☆10Updated 2 years ago
- ☆24Updated last year
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆26Updated last year
- ☆11Updated 2 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆28Updated 7 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- Code repository of the paper "CITRIS: Causal Identifiability from Temporal Intervened Sequences" and "iCITRIS: Causal Representation Lear…☆51Updated last year