aetting / lm-diagnosticsLinks
Diagnostic tests for linguistic capacities in language models
☆65Updated 3 years ago
Alternatives and similar repositories for lm-diagnostics
Users that are interested in lm-diagnostics are comparing it to the libraries listed below
Sorting:
- ☆231Updated 4 years ago
- Evaluating Text Representations on Lexical Composition☆24Updated 6 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆148Updated 3 years ago
- The Benchmark of Linguistic Minimal Pairs☆161Updated 3 years ago
- Heuristic Analysis for NLI Systems☆128Updated 5 years ago
- How Contextual are Contextualized Word Representations?☆43Updated 5 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Updated 5 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆101Updated 3 years ago
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆200Updated 5 years ago
- ☆34Updated last week
- A python package of common operations for AMRs☆29Updated 3 years ago
- EMNLP DiscoEval paper☆43Updated 6 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆188Updated 2 years ago
- ☆39Updated 4 years ago
- Lexical Substitution Framework☆46Updated 2 years ago
- Repository for DISRPT2023 shared task☆17Updated last year
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆30Updated 5 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆73Updated 4 months ago
- ☆59Updated 2 years ago
- Perspectrum: a dataset of claims, perspectives and evidence documents☆34Updated 6 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆59Updated 5 months ago
- ☆166Updated 3 years ago
- End-to-end shallow discourse parser☆23Updated 2 years ago
- Conversion scripts for coreference☆28Updated last year
- Multi-Annotator Competence Estimation tool☆134Updated 2 weeks ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆105Updated 5 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆96Updated 2 years ago
- ☆30Updated 2 years ago
- ☆15Updated 7 years ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆50Updated 3 years ago