successar / Eraser-Benchmark-Baseline-Models
Baseline for ERASER benchmark
☆17Updated 2 years ago
Alternatives and similar repositories for Eraser-Benchmark-Baseline-Models
Users that are interested in Eraser-Benchmark-Baseline-Models are comparing it to the libraries listed below
Sorting:
- Implementation for https://arxiv.org/abs/2005.00652☆28Updated 2 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆22Updated 3 years ago
- ☆27Updated last year
- ☆27Updated 2 years ago
- ☆58Updated 3 years ago
- Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021 and "…☆18Updated 3 years ago
- ☆24Updated last year
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- ☆20Updated 3 years ago
- SP-10K is a large-scale human-annotated selectional preference set. Five selectional preference relations are included.☆11Updated 5 years ago
- Code for ModularQA☆28Updated 3 years ago
- ☆13Updated 4 years ago
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆59Updated last year
- ☆45Updated 2 years ago
- ☆16Updated 3 years ago
- Repository for the code associated with the paper: Unsupervised Extractive Summarization using Mutual Information☆25Updated 3 years ago
- ☆46Updated 5 years ago
- WinoWhy provides human-annotated reasons for answering WSC questions.☆18Updated 5 years ago
- ☆15Updated 4 years ago
- ☆9Updated 5 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- Code base for paper "Zero-Shot Cross-Lingual Transfer with Meta Learning"☆34Updated 6 months ago
- ☆24Updated last year
- Code Repo for "Differentiable Open-Ended Commonsense Reasoning" (NAACL 2021)☆32Updated last year
- ☆59Updated last year
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Updated 4 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- ☆43Updated 5 years ago
- Code for "Understanding Neural Abstractive Summarization Models via Uncertainty" (EMNLP20)☆30Updated 4 years ago
- ☆17Updated 4 years ago