A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/
☆101Nov 11, 2022Updated 3 years ago
Alternatives and similar repositories for eraserbenchmark
Users that are interested in eraserbenchmark are comparing it to the libraries listed below
Sorting:
- Baseline for ERASER benchmark☆17Dec 18, 2022Updated 3 years ago
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆30Jun 12, 2023Updated 2 years ago
- ☆27Jun 12, 2023Updated 2 years ago
- Implementation for https://arxiv.org/abs/2005.00652☆28Dec 8, 2022Updated 3 years ago
- Explaining neural decisions contrastively to alternative decisions.☆25Mar 18, 2021Updated 4 years ago
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆18Dec 15, 2020Updated 5 years ago
- ☆166Apr 29, 2022Updated 3 years ago
- Commonsense Explanations Dataset and Code☆148Jun 16, 2025Updated 8 months ago
- AAAI 2022 paper - Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction☆18Dec 23, 2021Updated 4 years ago
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆210Oct 20, 2021Updated 4 years ago
- ☆13Jul 26, 2023Updated 2 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Apr 28, 2023Updated 2 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆20May 19, 2022Updated 3 years ago
- Interpretable Neural Predictions with Differentiable Binary Variables☆85May 7, 2021Updated 4 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- ☆20Oct 12, 2021Updated 4 years ago
- Learning the Difference that Makes a Difference with Counterfactually-Augmented Data☆171Apr 26, 2021Updated 4 years ago
- ☆26Apr 15, 2021Updated 4 years ago
- Pytorch implementation of DiffMask☆58Jun 12, 2023Updated 2 years ago
- Resources for the MRQA 2019 Shared Task☆294Aug 5, 2021Updated 4 years ago
- ☆24May 22, 2023Updated 2 years ago
- Official Code Repo for the Paper: "How does This Interaction Affect Me? Interpretable Attribution for Feature Interactions", In NeurIPS 2…☆42Oct 31, 2022Updated 3 years ago
- This data release is meant to accompany and document the paper: https://arxiv.org/abs/2004.11997 Collecting Entailment Data for Pretrain…☆14Sep 29, 2020Updated 5 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.☆12May 5, 2019Updated 6 years ago
- "Deriving Machine Attention from Human Rationales" EMNLP 2018☆27Feb 15, 2019Updated 7 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Jul 29, 2020Updated 5 years ago
- ☆64Apr 25, 2020Updated 5 years ago
- Code and dataset "ZEST" from "Learning from task descriptions", Weller et al, EMNLP 2020☆17Mar 15, 2021Updated 4 years ago
- r4c☆14Mar 2, 2021Updated 4 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Apr 26, 2021Updated 4 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- ☆49Aug 22, 2018Updated 7 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- ☆49Jun 12, 2023Updated 2 years ago
- Tensorflow implementation of Invariant Rationalization☆50Feb 16, 2023Updated 3 years ago
- ☆39Apr 29, 2023Updated 2 years ago
- Hyperparameter Search for AllenNLP☆140Mar 6, 2025Updated 11 months ago