allenai / contrastive-explanations
Explaining neural decisions contrastively to alternative decisions.
☆24Updated 4 years ago
Alternatives and similar repositories for contrastive-explanations:
Users that are interested in contrastive-explanations are comparing it to the libraries listed below
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆21Updated 4 years ago
- ☆44Updated last year
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆17Updated 2 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Updated 4 years ago
- Codebase for running (conditional) probing experiments☆23Updated 2 years ago
- ☆25Updated 3 years ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 2 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆19Updated 2 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Updated last year
- ☆24Updated 3 years ago
- ☆15Updated 3 years ago
- ☆18Updated 3 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- ☆26Updated 2 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆19Updated last week
- Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)☆27Updated 2 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆19Updated last year
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆17Updated 4 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- ☆10Updated 2 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- Group-conditional DRO to alleviate spurious correlations☆15Updated 3 years ago
- In-context Example Selection with Influences☆15Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated last year
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆15Updated 11 months ago
- Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…☆15Updated 2 years ago
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆60Updated 2 years ago
- ☆22Updated 2 years ago