kailas-v / human-ai-interactionsLinks
☆12Updated 2 years ago
Alternatives and similar repositories for human-ai-interactions
Users that are interested in human-ai-interactions are comparing it to the libraries listed below
Sorting:
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆13Updated 3 years ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Updated last year
- csl: PyTorch-based Constrained Learning☆12Updated 3 years ago
- Post-processing for fair classification☆16Updated 3 months ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆10Updated last year
- ☆10Updated 3 years ago
- ☆32Updated last year
- ☆14Updated 2 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Updated 4 years ago
- ☆34Updated 2 years ago
- Extending Conformal Prediction to LLMs☆67Updated last year
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated 2 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆27Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆25Updated 4 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆13Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 11 months ago
- Compositional Explanations of Neurons, NeurIPS 2020 https://arxiv.org/abs/2006.14032☆25Updated 4 years ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆24Updated last year
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆19Updated 3 years ago
- Explanation Optimization☆13Updated 4 years ago
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- Tools for robustness evaluation in interpretability methods☆10Updated 4 years ago
- ☆100Updated last year
- ☆38Updated last year
- ROCK Framework for Commonsense Causality Reasoning (CCR)☆10Updated 2 years ago
- The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"☆17Updated 4 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆17Updated 3 years ago
- Implementation of paper "Probabilistic Active Meta-Learning" (NeurIPS 2020).☆20Updated 4 years ago
- Explaining neural decisions contrastively to alternative decisions.☆25Updated 4 years ago