peterbhase / ExplanationSearchLinks

Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"

☆18

Alternatives and similar repositories for ExplanationSearch

Users that are interested in ExplanationSearch are comparing it to the libraries listed below

Sorting:

CoderPat / learning-scaffold
This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"
☆19Updated 3 years ago
salesforce / fast-influence-functions
☆89Updated 3 months ago
hsajjad / Interpretability-Tutorial-NAACL2021
☆24Updated 4 years ago
peterbhase / ExplanationRoles
Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"
☆14Updated 4 years ago
ruiqi-zhong / DescribeDistributionalDifferences
Code for preprint: Summarizing Differences between Text Distributions with Natural Language
☆42Updated 2 years ago
technion-cs-nlp / irm-for-nli
☆11Updated 3 years ago
camelop / NLP-Robustness
OOD Generalization and Detection (ACL 2020)
☆60Updated 5 years ago
copenlu / xai-benchmark
A Diagnostic Study of Explainability Techniques for Text Classification
☆68Updated 4 years ago
INK-USC / DIG
Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)
☆27Updated 3 years ago
mourga / contrastive-active-learning
Code for the EMNLP 2021 Paper "Active Learning by Acquiring Contrastive Examples" & the ACL 2022 Paper "On the Importance of Effectively …
☆126Updated 3 years ago
easonnie / ChaosNLI
[EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data
☆38Updated 3 years ago
allenai / contrastive-explanations
Explaining neural decisions contrastively to alternative decisions.
☆25Updated 4 years ago
yanaiela / pararel
☆45Updated last year
shreydesai / calibration
Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"
☆61Updated 2 years ago
AI-secure / InfoBERT
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…
☆85Updated last year
archiki / GrIPS
Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"
☆55Updated 2 years ago
technion-cs-nlp / bias-probing
Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data
☆14Updated 3 years ago
awebson / prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆85Updated 3 years ago
keyonvafa / sequential-rationales
Rationales for Sequential Predictions
☆40Updated 3 years ago
awasthiabhijeet / Learning-From-Rules
Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net…
☆50Updated 2 years ago
alexandra-chron / hierarchical-domain-adaptation
Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.
☆32Updated last year
peterbhase / LAS-NL-Explanations
Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"
☆22Updated 4 years ago
allenai / mice
☆26Updated 2 years ago
princeton-nlp / rationale-robustness
NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790
☆27Updated 2 years ago
jzbjyb / lm-calibration
☆35Updated 3 years ago
successar / FRESH
☆27Updated 2 years ago
acmi-lab / counterfactually-augmented-data
Learning the Difference that Makes a Difference with Counterfactually-Augmented Data
☆170Updated 4 years ago
INK-USC / expl-refinement
Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)
☆12Updated 3 years ago
pliang279 / LM_bias
[ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models
☆61Updated 2 years ago
shauli-ravfogel / nullspace_projection
☆89Updated 3 years ago