tommccoy1 / hans
Heuristic Analysis for NLI Systems
☆125Updated 4 years ago
Alternatives and similar repositories for hans
Users that are interested in hans are comparing it to the libraries listed below
Sorting:
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- ☆59Updated last year
- ☆139Updated 4 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- ☆229Updated 4 years ago
- EMNLP DiscoEval paper☆43Updated 5 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- ☆46Updated 5 years ago
- ☆163Updated 3 years ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆49Updated 2 years ago
- ☆42Updated 4 years ago
- Codebase for the Summary Loop paper at ACL2020☆44Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆143Updated 2 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- NLI test set with lexical inferences☆49Updated 6 years ago
- ☆27Updated 2 years ago
- This is the Grammarly's Yahoo Answers Formality Corpus☆105Updated last year
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆36Updated 4 years ago
- Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Re…☆71Updated 2 years ago
- Neural Module Network for Reasoning over Text, ICLR 2020☆120Updated 4 years ago
- ☆63Updated 5 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆99Updated 5 years ago
- ACL 2020 Tutorial by Malihe Alikhani and Matthew Stone☆37Updated 4 years ago
- Tutorials on training and testing retrieval-based models (DrQA & DPR)☆51Updated 4 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 5 years ago
- Hyperparameter Search for AllenNLP☆139Updated 2 months ago
- A reference-free metric for measuring summary quality, learned from human ratings.☆43Updated 2 years ago
- syntactically controlled paraphrase networks☆167Updated 6 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago