aetting / lm-diagnosticsLinks
Diagnostic tests for linguistic capacities in language models
☆66Updated 3 years ago
Alternatives and similar repositories for lm-diagnostics
Users that are interested in lm-diagnostics are comparing it to the libraries listed below
Sorting:
- ☆230Updated 4 years ago
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆199Updated 4 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆144Updated 2 years ago
- The Benchmark of Linguistic Minimal Pairs☆152Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- ☆59Updated 2 years ago
- Heuristic Analysis for NLI Systems☆126Updated 4 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆29Updated 5 years ago
- Perspectrum: a dataset of claims, perspectives and evidence documents☆34Updated 5 years ago
- How Contextual are Contextualized Word Representations?☆41Updated 5 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆103Updated 4 years ago
- Cleaned E2E NLG Challenge data + supporting scripts☆23Updated 4 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- Evaluating recurrent neural networks on predicting subject-verb agreement dependencies☆63Updated 2 years ago
- This is the Grammarly's Yahoo Answers Formality Corpus☆106Updated last week
- End-to-end shallow discourse parser☆21Updated 2 years ago
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Updated 3 years ago
- ☆25Updated last year
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated 2 years ago
- ☆39Updated 4 years ago
- ☆163Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated 2 years ago
- ACL 2020 Tutorial by Malihe Alikhani and Matthew Stone☆37Updated 5 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆95Updated 2 years ago
- ☆44Updated 4 years ago
- EMNLP DiscoEval paper☆43Updated 5 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆40Updated 4 years ago
- ☆43Updated 5 years ago
- ☆46Updated 5 years ago