google-research / fool-me-twice
Game code and data for Fool Me Twice: Entailment from Wikipedia Gamification https://arxiv.org/abs/2104.04725
☆18Updated last week
Alternatives and similar repositories for fool-me-twice:
Users that are interested in fool-me-twice are comparing it to the libraries listed below
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- ☆46Updated 5 years ago
- ☆77Updated 9 months ago
- Commonsense Ability Tests☆30Updated 2 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated 2 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- Code for WikiAsp: Multi-document aspect-based summarization.☆40Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Code for ModularQA☆28Updated 3 years ago
- ☆68Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆52Updated 2 years ago
- Repository for the CODAH dataset☆22Updated 2 years ago
- ☆21Updated 3 years ago
- ☆13Updated 3 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".☆27Updated 3 years ago
- Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021 and "…☆18Updated 3 years ago
- ☆33Updated last year
- FaVIQ: Fact Verification from Information-seeking Questions☆43Updated 2 years ago
- ☆20Updated 2 years ago
- ☆32Updated 3 years ago
- code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"☆16Updated 2 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering☆36Updated 3 years ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Updated 4 years ago
- ☆13Updated 4 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 3 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆36Updated 2 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆71Updated 6 months ago
- Contrastive Fact Verification☆71Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- Adaptive Passage Encoder for Open-domain Question Answering☆15Updated 3 years ago