☆103Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for QuestEval
Users that are interested in QuestEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Jun 12, 2023Updated 2 years ago
- ☆55Mar 27, 2023Updated 3 years ago
- Codebase, data and models for the SummaC paper in TACL☆108Jan 30, 2025Updated last year
- FactSumm: Factual Consistency Scorer for Abstractive Summarization☆113Jan 1, 2024Updated 2 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated 2 years ago
- Code for ViLBERTScore in EMNLP Eval4NLP☆18Oct 27, 2022Updated 3 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆150Oct 22, 2022Updated 3 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Nov 11, 2021Updated 4 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆367Jun 27, 2022Updated 3 years ago
- Question Answering and Generation for Summarization☆71Nov 27, 2022Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆59Dec 13, 2022Updated 3 years ago
- Code for EMNLP 2021 paper "CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization"☆47Jan 17, 2022Updated 4 years ago
- ☆10Jul 18, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- KPQA is an evaluation metric for generative question answering. (NAACL-21)☆33Aug 3, 2021Updated 4 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆308May 1, 2025Updated 10 months ago
- ☆27Nov 29, 2022Updated 3 years ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 2 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆413Jun 23, 2024Updated last year
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Jul 22, 2025Updated 8 months ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Nov 26, 2020Updated 5 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆149May 1, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons☆14Dec 22, 2022Updated 3 years ago
- ☆18Apr 11, 2021Updated 4 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆40Sep 15, 2022Updated 3 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Mar 20, 2023Updated 3 years ago
- Tools to estimate the correlation of different text-based evaluation measures for Automatic Image Description☆10Feb 2, 2017Updated 9 years ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- Human-free quality estimation of document summaries☆97Dec 1, 2025Updated 3 months ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆213Nov 20, 2023Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Creative Instructions Project☆11Sep 4, 2023Updated 2 years ago
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆13Nov 14, 2022Updated 3 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Multilingual abstractive summarization dataset extracted from WikiHow.☆99Mar 14, 2025Updated last year
- ☆46May 26, 2023Updated 2 years ago
- ☆39Jan 9, 2023Updated 3 years ago
- Implementation of the paper "FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations (NAACL 2022)"☆50Jul 26, 2023Updated 2 years ago