peterbhase / ExplanationSearchLinks
Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"
☆18Updated 2 years ago
Alternatives and similar repositories for ExplanationSearch
Users that are interested in ExplanationSearch are comparing it to the libraries listed below
Sorting:
- ☆24Updated 4 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆19Updated 3 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Updated 4 years ago
- ☆11Updated 3 years ago
- Explaining neural decisions contrastively to alternative decisions.☆25Updated 4 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Updated last year
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆22Updated 4 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- Codebase for running (conditional) probing experiments☆22Updated 2 years ago
- In-context Example Selection with Influences☆15Updated 2 years ago
- ☆10Updated 2 years ago
- ☆25Updated 3 years ago
- ☆13Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated 2 years ago
- Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)☆27Updated 3 years ago
- ☆26Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆38Updated 2 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data☆14Updated 3 years ago
- ☆39Updated 4 years ago
- ☆89Updated last month
- NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790☆27Updated 2 years ago
- ☆89Updated 3 years ago
- A Diagnostic Study of Explainability Techniques for Text Classification☆67Updated 4 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Updated 3 years ago
- Pytorch implementation of DiffMask☆56Updated 2 years ago
- CausaLM: Causal Model Explanation Through Counterfactual Language Models☆55Updated 5 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- ☆31Updated last year
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 3 years ago