successar / Eraser-Benchmark-Baseline-ModelsLinks
Baseline for ERASER benchmark
☆17Updated 2 years ago
Alternatives and similar repositories for Eraser-Benchmark-Baseline-Models
Users that are interested in Eraser-Benchmark-Baseline-Models are comparing it to the libraries listed below
Sorting:
- Implementation for https://arxiv.org/abs/2005.00652☆28Updated 2 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆22Updated 3 years ago
- ☆20Updated 3 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- Framework for testing models with AI2 leaderboards☆21Updated last year
- ☆46Updated 5 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆59Updated last year
- A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…☆17Updated 3 years ago
- ☆58Updated 3 years ago
- ☆27Updated last year
- ☆15Updated 4 years ago
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆30Updated last year
- ☆46Updated 2 years ago
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- ☆27Updated 2 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆71Updated 10 months ago
- Repository for the code associated with the paper: Unsupervised Extractive Summarization using Mutual Information☆25Updated 3 years ago
- Code base for paper "Zero-Shot Cross-Lingual Transfer with Meta Learning"☆34Updated 6 months ago
- ☆24Updated last year
- Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021 and "…☆18Updated 3 years ago
- ☆24Updated 3 years ago
- ☆17Updated 5 years ago
- ☆49Updated last year
- Code for Repl4NLP paper "A Cross-Task Analysis of Text Span Representations"☆21Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 4 years ago
- ACL 2020 Tutorial by Malihe Alikhani and Matthew Stone☆37Updated 4 years ago
- SP-10K is a large-scale human-annotated selectional preference set. Five selectional preference relations are included.☆11Updated 5 years ago
- Code to create pre-training data for a span selection pre-training task inspired by reading comprehension and an effort to avoid encoding…☆30Updated 3 years ago
- ☆24Updated 2 years ago