kailas-v / human-ai-interactions
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for human-ai-interactions
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 3 years ago
- ☆10Updated 2 years ago
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year
- Compositional Explanations of Neurons, NeurIPS 2020 https://arxiv.org/abs/2006.14032☆25Updated 3 years ago
- ☆15Updated 9 months ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆13Updated this week
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆19Updated 5 months ago
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆12Updated 3 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- Extending Conformal Prediction to LLMs☆58Updated 5 months ago
- A Python Data Valuation Package☆28Updated last year
- ☆32Updated last year
- Jiminy Cricket Environment (NeurIPS 2021)☆24Updated 2 years ago
- Experiments with experimental rule-based models to go along with imodels.☆15Updated 2 weeks ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆98Updated 5 months ago
- Rewarded soups official implementation☆51Updated last year
- Explaining neural decisions contrastively to alternative decisions.☆23Updated 3 years ago
- Tools for robustness evaluation in interpretability methods☆11Updated 3 years ago
- ☆43Updated 2 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆40Updated 2 years ago
- Code and webpages for our study on teaching humans to defer to an AI☆11Updated last year
- Code for "Generative causal explanations of black-box classifiers"☆33Updated 3 years ago
- Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)☆42Updated 4 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Updated 3 years ago
- ☆31Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆27Updated 11 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆9Updated 8 months ago
- Code for paper: Are Large Language Models Post Hoc Explainers?☆27Updated 4 months ago
- ☆19Updated 4 years ago
- ☆18Updated last month