danishpruthi / deceptive-attentionLinks

Code & Data for the paper "Learning to Deceive with Attention-Based Explanations"

☆18

Alternatives and similar repositories for deceptive-attention

Users that are interested in deceptive-attention are comparing it to the libraries listed below

Sorting:

xhan77 / influence-function-analysis
☆63Updated 5 years ago
Eric-Wallace / interpretability-tutorial-emnlp2020
Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"
☆199Updated 4 years ago
nicola-decao / diffmask
Pytorch implementation of DiffMask
☆57Updated 2 years ago
OanaMariaCamburu / e-SNLI
☆164Updated 3 years ago
shauli-ravfogel / nullspace_projection
☆89Updated 3 years ago
bastings / interpretable_predictions
Interpretable Neural Predictions with Differentiable Binary Variables
☆84Updated 4 years ago
john-hewitt / control-tasks
Repository describing example random control tasks for designing and interpreting neural probes
☆32Updated 3 years ago
lena-voita / description-length-probing
This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"
☆71Updated 11 months ago
sarahwie / attention
Code for EMNLP 2019 paper "Attention is not not Explanation"
☆58Updated 4 years ago
ShilinHe / interpretableNLP
A list of publications on NLP interpretability (Welcome PR)
☆168Updated 4 years ago
acmi-lab / counterfactually-augmented-data
Learning the Difference that Makes a Difference with Counterfactually-Augmented Data
☆170Updated 4 years ago
copenlu / X-MAML
Code base for paper "Zero-Shot Cross-Lingual Transfer with Meta Learning"
☆34Updated 8 months ago
shentianxiao / text-autoencoders
☆209Updated last year
tommccoy1 / hans
Heuristic Analysis for NLI Systems
☆126Updated 4 years ago
bohanli / vae-pretraining-encoder
PyTorch implementation of A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text (EMNLP 2019)
☆48Updated 5 years ago
jayded / eraserbenchmark
A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/
☆96Updated 2 years ago
CogComp / perspectrum
Perspectrum: a dataset of claims, perspectives and evidence documents
☆34Updated 5 years ago
nitishgupta / nmn-drop
Neural Module Network for Reasoning over Text, ICLR 2020
☆120Updated 4 years ago
rpryzant / delete_retrieve_generate
PyTorch implementation of the Delete, Retrieve Generate style transfer algorithm
☆132Updated last year
rtmdrr / testSignificanceNLP
☆230Updated 4 years ago
aetting / lm-diagnostics
Diagnostic tests for linguistic capacities in language models
☆66Updated 3 years ago
bhargaviparanjape / explainable_qa
Implementation for https://arxiv.org/abs/2005.00652
☆28Updated 2 years ago
pmichel31415 / teapot-nlp
Tool for Evaluating Adversarial Perturbations on Text
☆61Updated 3 years ago
raosudha89 / GYAFC-corpus
This is the Grammarly's Yahoo Answers Formality Corpus
☆107Updated 2 weeks ago
boknilev / nlp-analysis-methods
Companion site for "Analysis Methods in Neural Language Processing: A Survey"
☆66Updated 5 years ago
SawanKumar28 / nile
NILE : Natural Language Inference with Faithful Natural Language Explanations
☆30Updated 2 years ago
CannyLab / summary_loop
Codebase for the Summary Loop paper at ACL2020
☆46Updated 2 years ago
asappresearch / rationale-alignment
☆46Updated 5 years ago
copenlu / xai-benchmark
A Diagnostic Study of Explainability Techniques for Text Classification
☆68Updated 4 years ago
BrendanKennedy / contextualizing-hate-speech-models-with-explanations
Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"
☆35Updated 3 years ago