artidoro / frankLinks
FRANK: Factuality Evaluation Benchmark
☆59Updated 2 years ago
Alternatives and similar repositories for frank
Users that are interested in frank are comparing it to the libraries listed below
Sorting:
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆49Updated 2 years ago
- ☆14Updated 2 years ago
- ☆46Updated 2 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated 2 years ago
- ☆39Updated 2 years ago
- ☆27Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Updated 4 years ago
- Question Answering and Generation for Summarization☆71Updated 2 years ago
- ☆24Updated 3 years ago
- Codebase, data and models for the SummaC paper in TACL☆100Updated 7 months ago
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆55Updated 2 years ago
- ☆58Updated 3 years ago
- ☆50Updated 2 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆133Updated 2 years ago
- ☆28Updated 2 years ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆27Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆63Updated 2 years ago
- ☆50Updated 2 years ago
- ☆18Updated 4 years ago
- ☆43Updated 2 years ago
- ☆62Updated 2 years ago
- ☆17Updated 2 years ago
- Code and resources for papers "Generation-Augmented Retrieval for Open-Domain Question Answering" and "Reader-Guided Passage Reranking fo…☆74Updated 3 years ago
- The code repository for NAACL 2021 paper "AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization".☆35Updated 4 years ago
- ☆44Updated 4 years ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆33Updated 3 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆39Updated 3 years ago
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆63Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆42Updated 2 years ago
- Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021 and "…☆18Updated 3 years ago