allenai / winograndeLinks
WinoGrande: An Adversarial Winograd Schema Challenge at Scale
☆95Updated 5 years ago
Alternatives and similar repositories for winogrande
Users that are interested in winogrande are comparing it to the libraries listed below
Sorting:
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆108Updated 3 years ago
- Heuristic Analysis for NLI Systems☆125Updated 4 years ago
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Updated 3 years ago
- The Benchmark of Linguistic Minimal Pairs☆150Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆103Updated 4 years ago
- Language model Prompt And Query Archive☆158Updated 4 years ago
- Code and Data for Evaluation WG☆41Updated 3 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- Evaluating recurrent neural networks on predicting subject-verb agreement dependencies☆63Updated 2 years ago
- ☆46Updated 2 years ago
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆28Updated 3 years ago
- ☆97Updated 2 years ago
- ☆84Updated 2 years ago
- Automatic metrics for GEM tasks☆66Updated 2 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆78Updated last year
- Semantic parsers based on encoder-decoder framework☆92Updated 2 years ago
- ☆59Updated last year
- Repository for the Question Answering via Sentence Composition (QASC) dataset☆56Updated last year
- Hyperparameter Search for AllenNLP☆139Updated 3 months ago
- ☆60Updated 2 years ago
- Codebase for the Summary Loop paper at ACL2020☆44Updated last year
- ☆42Updated 4 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- PyTorch original implementation of "Unsupervised Question Decomposition for Question Answering"☆121Updated last year
- Perspectrum: a dataset of claims, perspectives and evidence documents☆33Updated 5 years ago
- ☆82Updated 2 years ago