tommccoy1 / hans
Heuristic Analysis for NLI Systems
☆125Updated 4 years ago
Alternatives and similar repositories for hans:
Users that are interested in hans are comparing it to the libraries listed below
- ☆58Updated last year
- ☆228Updated 3 years ago
- ☆138Updated 3 years ago
- Hyperparameter Search for AllenNLP☆135Updated last month
- NLI test set with lexical inferences☆48Updated 6 years ago
- Neural Module Network for Reasoning over Text, ICLR 2020☆120Updated 4 years ago
- ☆163Updated 2 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 2 years ago
- ☆46Updated 5 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- Evaluating recurrent neural networks on predicting subject-verb agreement dependencies☆62Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 4 years ago
- ☆39Updated 3 years ago
- code and data for EMNLP-19 paper "Counterfactual Story Reasoning and Generation" https://arxiv.org/abs/1909.04076☆101Updated 4 years ago
- EMNLP DiscoEval paper☆42Updated 5 years ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆48Updated 2 years ago
- Assessing syntactic abilities of BERT☆148Updated 5 years ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Updated 3 years ago
- This is the Grammarly's Yahoo Answers Formality Corpus☆106Updated last year
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- Codebase for the Summary Loop paper at ACL2020☆44Updated last year
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 5 years ago
- This repository contains the script to compute the questions based on the Answerability aspect.☆38Updated 5 years ago
- Evaluate your dialog model with 17 metrics! (see paper)☆97Updated 4 years ago
- Dataset and code for “Going on a vacation” takes longer than “Going for a walk”: A Study of Temporal Commonsense Understanding, EMNLP 201…☆39Updated 4 years ago
- Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Re…☆71Updated 2 years ago
- The Benchmark of Linguistic Minimal Pairs☆148Updated 2 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆99Updated 5 years ago