google-research-datasets / xsum_hallucination_annotations
Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (https://www.aclweb.org/anthology/2020.acl-main.173.pdf).
☆81Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for xsum_hallucination_annotations
- ☆57Updated last year
- FRANK: Factuality Evaluation Benchmark☆52Updated last year
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆47Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆138Updated 2 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated last year
- ☆57Updated 2 years ago
- ☆41Updated 3 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆148Updated last year
- ☆41Updated last year
- Codebase, data and models for the SummaC paper in TACL☆85Updated 10 months ago
- ☆27Updated last year
- ☆45Updated last year
- ☆46Updated 4 years ago
- ☆48Updated last year
- ☆42Updated last year
- ☆77Updated 6 months ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆59Updated 4 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Updated last year
- ☆70Updated 3 years ago
- ☆24Updated 2 years ago
- ☆37Updated 3 years ago
- ☆90Updated 8 months ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆97Updated last year
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆55Updated last year
- Detect hallucinated tokens for conditional sequence generation.☆63Updated 2 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆110Updated last year
- Question Answering and Generation for Summarization☆68Updated last year
- ☆42Updated 3 years ago