WEIRDLabUW / vpl_llm
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆16Updated 8 months ago
Alternatives and similar repositories for vpl_llm:
Users that are interested in vpl_llm are comparing it to the libraries listed below
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆29Updated last year
- ☆31Updated 4 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 7 months ago
- ☆17Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Code repository complementing the ICLR 2021 paper "Unsupervised Object Keypoint Learning using Local Spatial Predictability" (https://arx…☆9Updated 3 months ago
- ☆23Updated 7 months ago
- Official code of the paper "BISCUIT: Causal Representation Learning from Binary Interactions" (UAI 2023)☆32Updated last year
- PyTorch Package For Quasimetric Learning☆41Updated 5 months ago
- Code repository of the paper "CITRIS: Causal Identifiability from Temporal Intervened Sequences" and "iCITRIS: Causal Representation Lear…☆51Updated last year
- Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning☆11Updated 2 months ago
- [CLeaR23] Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning☆30Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Structured Neural Networks☆14Updated 11 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆23Updated last month
- ☆15Updated 5 months ago
- Code to reproduce the experimental results from the paper "Active Invariant Causal Prediction: Experiment Selection Through Stability", b…☆20Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- ☆24Updated last year
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆22Updated last year
- ☆15Updated 2 years ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 3 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Official PyTorch implementation of NPwSA: "Neural Processes with Stochastic Attention: Paying more attention to the context dataset (ICLR…☆10Updated 2 years ago
- ☆37Updated 8 months ago
- Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding☆22Updated 2 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- Implementations of growing and pruning in neural networks☆22Updated last year