aetting / lm-diagnostics
Diagnostic tests for linguistic capacities in language models
☆66Updated 2 years ago
Alternatives and similar repositories for lm-diagnostics:
Users that are interested in lm-diagnostics are comparing it to the libraries listed below
- ☆31Updated 3 months ago
- ☆39Updated 3 years ago
- End-to-end shallow discourse parser☆20Updated last year
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- ☆29Updated last year
- A python package of common operations for AMRs☆29Updated 2 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Updated 3 years ago
- ☆229Updated 4 years ago
- Evaluating recurrent neural networks on predicting subject-verb agreement dependencies☆63Updated 2 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆29Updated 5 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- Organized inventory of research using the Abstract Meaning Representation☆37Updated this week
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Assessing syntactic abilities of BERT☆148Updated 5 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆40Updated 4 years ago
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆43Updated 5 years ago
- How Contextual are Contextualized Word Representations?☆41Updated 5 years ago
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆28Updated 3 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- Perspectrum: a dataset of claims, perspectives and evidence documents☆33Updated 5 years ago
- EMNLP DiscoEval paper☆43Updated 5 years ago
- ☆59Updated last year
- Various utility scripts useful for natural language processing, machine translation, etc.☆49Updated 2 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 5 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆103Updated 4 years ago
- Hyperparameter Search for AllenNLP☆139Updated last month
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- Code and data for paper Colorless Green Recurrent Networks Dream Hierarchically☆92Updated 3 years ago