allenai / winograndeLinks

WinoGrande: An Adversarial Winograd Schema Challenge at Scale

☆98

Alternatives and similar repositories for winogrande

Users that are interested in winogrande are comparing it to the libraries listed below

Sorting:

shmsw25 / AmbigQA
An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"
☆119Updated 3 years ago
allenai / Break
☆84Updated 2 years ago
jzbjyb / LPAQA
Language model Prompt And Query Archive
☆158Updated 4 years ago
google-research / dialog-inpainting
☆97Updated 2 years ago
cambridgeltl / xcopa
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
☆103Updated 4 years ago
GEM-benchmark / GEM-metrics
Automatic metrics for GEM tasks
☆66Updated 2 years ago
jayded / eraserbenchmark
A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/
☆96Updated 2 years ago
tommccoy1 / hans
Heuristic Analysis for NLI Systems
☆126Updated 4 years ago
Yale-LILY / dart
Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"
☆155Updated 2 years ago
facebookresearch / asset
A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations
☆56Updated 2 years ago
google-research-datasets / xsum_hallucination_annotations
Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…
☆82Updated 4 years ago
allenai / contrast-sets
☆59Updated 2 years ago
ElementalCognition / glucose
GLUCOSE: GeneraLized and COntextualized Story Explanations https://arxiv.org/abs/2009.07758
☆92Updated 4 years ago
peterwestuw / surface-form-competition
☆58Updated 3 years ago
peterwestai2 / symbolic-knowledge-distillation
☆141Updated 2 years ago
jacobandreas / geca
☆42Updated 4 years ago
allenai / allennlp-semparse
A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP
☆108Updated 3 years ago
awebson / prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆85Updated 3 years ago
allenai / qasc
Repository for the Question Answering via Sentence Composition (QASC) dataset
☆56Updated last year
shmsw25 / bart-closed-book-qa
A BART version of an open-domain QA model in a closed-book setup
☆119Updated 4 years ago
tomerwolgithub / Break
☆46Updated 2 years ago
berlino / tensor2struct-public
Semantic parsers based on encoder-decoder framework
☆91Updated 2 years ago
INK-USC / CrossFit
Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)
☆111Updated 3 years ago
McGill-NLP / FaithDial
☆50Updated 2 years ago
TevenLeScao / pet
This repository contains the code for "How many data points is a prompt worth?"
☆48Updated 4 years ago
facebookresearch / PAQ
Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"
☆204Updated 3 years ago
najoungkim / COGS
☆61Updated 2 years ago
Shikib / usr
Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…
☆50Updated 2 years ago
neulab / REALSumm
REALSumm: Re-evaluating Evaluation in Text Summarization
☆71Updated 2 years ago
vipulraheja / iterater
Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)
☆78Updated last year