WEIRDLabUW / vpl_llmLinks
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆20Updated 10 months ago
Alternatives and similar repositories for vpl_llm
Users that are interested in vpl_llm are comparing it to the libraries listed below
Sorting:
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆30Updated last year
- Generalised UDRL☆37Updated 3 years ago
- Reinforcement Learning via Regressing Relative Rewards☆34Updated 7 months ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆10Updated last month
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 10 months ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- Code repository complementing the ICLR 2021 paper "Unsupervised Object Keypoint Learning using Local Spatial Predictability" (https://arx…☆9Updated 6 months ago
- ☆17Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- Code repository of the paper "CITRIS: Causal Identifiability from Temporal Intervened Sequences" and "iCITRIS: Causal Representation Lear…☆53Updated 2 years ago
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆25Updated last year
- ☆32Updated 11 months ago
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆24Updated last year
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆27Updated last year
- A PyTorch Implementation of Skipper☆28Updated 9 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆28Updated last year
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆33Updated 11 months ago
- PyTorch Package For Quasimetric Learning☆42Updated 8 months ago
- INTeractive learning via REPresentatIon Discovery☆34Updated last year
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆34Updated 2 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆18Updated 4 years ago
- Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning☆12Updated 5 months ago
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆32Updated 4 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆54Updated 3 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Variational Reinforcement Learning☆16Updated 11 months ago