TalSchuster / VitaminC
Contrastive Fact Verification
☆71Updated 2 years ago
Alternatives and similar repositories for VitaminC:
Users that are interested in VitaminC are comparing it to the libraries listed below
- Symmetric evaluation set based on the FEVER (fact verification) dataset☆52Updated 3 years ago
- Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"☆63Updated 3 years ago
- ☆58Updated 2 years ago
- ☆15Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆52Updated 2 years ago
- ☆24Updated last year
- ☆33Updated last year
- ☆47Updated 2 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- ☆45Updated last year
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Updated last year
- ☆70Updated 10 months ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated 2 years ago
- ☆34Updated 4 years ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Updated 2 years ago
- ☆77Updated 9 months ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Updated 2 years ago
- ☆61Updated 2 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 3 years ago
- ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences☆28Updated last year
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- Source code of the paper "Do Syntax Trees Help Pre-trained Transformers Extract Information?" (EACL 2021)☆75Updated 3 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 3 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 4 years ago
- ☆31Updated last year
- Code for our EACL-2021 paper "Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs".☆38Updated 7 months ago