allenai / winograndeLinks
WinoGrande: An Adversarial Winograd Schema Challenge at Scale
☆101Updated 5 years ago
Alternatives and similar repositories for winogrande
Users that are interested in winogrande are comparing it to the libraries listed below
Sorting:
- ☆97Updated 3 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆105Updated 4 years ago
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆113Updated 3 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Updated 5 years ago
- ☆40Updated 3 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆157Updated 3 years ago
- Neural models of common sense. 🤖☆98Updated 2 years ago
- ☆85Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- ☆47Updated 2 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆57Updated 3 years ago
- ☆59Updated 2 years ago
- Automatic metrics for GEM tasks☆67Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆209Updated 4 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆120Updated 3 years ago
- ☆58Updated 3 years ago
- ☆83Updated 2 years ago
- Language model Prompt And Query Archive☆160Updated 4 years ago
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆83Updated 2 weeks ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆98Updated 2 years ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆183Updated 3 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆80Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆142Updated 4 years ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆20Updated 4 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Updated 3 years ago
- ☆175Updated 6 years ago
- ☆30Updated 4 years ago
- ☆329Updated 4 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆41Updated 3 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago