peterbhase / ExplanationSearch
Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"
☆17Updated 2 years ago
Alternatives and similar repositories for ExplanationSearch:
Users that are interested in ExplanationSearch are comparing it to the libraries listed below
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆19Updated 2 years ago
- ☆24Updated 3 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Updated 4 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Updated last year
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆20Updated 4 years ago
- Explaining neural decisions contrastively to alternative decisions.☆24Updated 3 years ago
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆17Updated 4 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated last year
- ☆11Updated 2 years ago
- Codebase for running (conditional) probing experiments☆23Updated 2 years ago
- In-context Example Selection with Influences☆15Updated last year
- A Diagnostic Study of Explainability Techniques for Text Classification☆66Updated 4 years ago
- ☆87Updated last year
- Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)☆25Updated 2 years ago
- ☆38Updated last year
- ☆27Updated last year
- ☆10Updated 2 years ago
- ☆13Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated last year
- Rationales for Sequential Predictions☆40Updated 2 years ago
- NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790☆27Updated 2 years ago
- A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…☆17Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 2 years ago
- ☆64Updated 4 years ago
- ☆26Updated 2 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Updated 3 years ago
- ☆44Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago