tommccoy1 / hans
Heuristic Analysis for NLI Systems
☆125Updated 4 years ago
Alternatives and similar repositories for hans:
Users that are interested in hans are comparing it to the libraries listed below
- ☆59Updated last year
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 4 years ago
- ☆229Updated 4 years ago
- Hyperparameter Search for AllenNLP☆137Updated 3 weeks ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆48Updated 2 years ago
- EMNLP DiscoEval paper☆43Updated 5 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Re…☆71Updated 2 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Updated 3 years ago
- ☆46Updated 5 years ago
- NLI test set with lexical inferences☆49Updated 6 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- ☆163Updated 2 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- Codebase for the Summary Loop paper at ACL2020☆44Updated last year
- Lexically constrained decoding for sequence generation using Grid Beam Search☆91Updated 6 years ago
- This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"☆69Updated 7 months ago
- Neural Module Network for Reasoning over Text, ICLR 2020☆120Updated 4 years ago
- ☆138Updated 3 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 2 years ago
- The Benchmark of Linguistic Minimal Pairs☆149Updated 2 years ago
- ☆42Updated 4 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆142Updated 2 years ago
- ☆64Updated 4 years ago
- ☆58Updated 2 years ago
- Assessing syntactic abilities of BERT☆148Updated 5 years ago
- Tutorials on training and testing retrieval-based models (DrQA & DPR)☆51Updated 4 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆36Updated 4 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆152Updated 2 years ago