kailas-v / human-ai-interactions
☆11Updated 2 years ago
Alternatives and similar repositories for human-ai-interactions:
Users that are interested in human-ai-interactions are comparing it to the libraries listed below
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆12Updated 3 years ago
- Post-processing for fair classification☆12Updated 2 months ago
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year
- ☆58Updated 3 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆23Updated 3 years ago
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆48Updated last year
- Group-conditional DRO to alleviate spurious correlations☆15Updated 3 years ago
- In-context Example Selection with Influences☆15Updated last year
- ☆27Updated 6 months ago
- Tools for robustness evaluation in interpretability methods☆10Updated 3 years ago
- Compositional Explanations of Neurons, NeurIPS 2020 https://arxiv.org/abs/2006.14032☆25Updated 3 years ago
- Explaining neural decisions contrastively to alternative decisions.☆23Updated 3 years ago
- Solving the causality pairs challenge (does A cause B) with ChatGPT☆75Updated 7 months ago
- The code for the paper *The Sensitivity of Counterfactual Fairness to Unmeasured Confounding* @ UAI 2019☆12Updated 4 years ago
- ☆86Updated last year
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆9Updated 10 months ago
- ☆34Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs?☆26Updated 7 months ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Updated 3 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆13Updated 2 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated last year
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆34Updated this week
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 3 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Toy datasets to evaluate algorithms for domain generalization and invariance learning.☆42Updated 3 years ago
- Jiminy Cricket Environment (NeurIPS 2021)☆24Updated 2 years ago
- Code for "Generative causal explanations of black-box classifiers"☆33Updated 4 years ago