artidoro / frank
FRANK: Factuality Evaluation Benchmark
☆54Updated 2 years ago
Alternatives and similar repositories for frank:
Users that are interested in frank are comparing it to the libraries listed below
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Updated last year
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 4 years ago
- ☆45Updated last year
- ☆58Updated 2 years ago
- ☆47Updated 2 years ago
- ☆38Updated last year
- ☆14Updated last year
- Question Answering and Generation for Summarization☆68Updated 2 years ago
- ☆33Updated last year
- ☆24Updated 2 years ago
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆55Updated last year
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆23Updated 11 months ago
- ☆27Updated 2 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆117Updated last year
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆62Updated last year
- Codebase, data and models for the SummaC paper in TACL☆89Updated last month
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- ☆25Updated 2 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- ☆44Updated last year
- ☆29Updated 3 years ago
- ☆82Updated last year
- ☆43Updated 3 years ago
- ☆15Updated 3 years ago
- ☆77Updated 10 months ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Updated 2 years ago
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021 and "…☆18Updated 3 years ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 3 years ago