☆104Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for QuestEval
Users that are interested in QuestEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Jun 12, 2023Updated 2 years ago
- ☆57Mar 27, 2023Updated 3 years ago
- Codebase, data and models for the SummaC paper in TACL☆109Jan 30, 2025Updated last year
- FactSumm: Factual Consistency Scorer for Abstractive Summarization☆113Jan 1, 2024Updated 2 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated 2 years ago
- Code for ViLBERTScore in EMNLP Eval4NLP☆18Oct 27, 2022Updated 3 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆151Oct 22, 2022Updated 3 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Nov 11, 2021Updated 4 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆369Jun 27, 2022Updated 3 years ago
- Question Answering and Generation for Summarization☆71Nov 27, 2022Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆59Dec 13, 2022Updated 3 years ago
- Code for EMNLP 2021 paper "CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization"☆47Jan 17, 2022Updated 4 years ago
- ☆10Jul 18, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- KPQA is an evaluation metric for generative question answering. (NAACL-21)☆33Aug 3, 2021Updated 4 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆308May 1, 2025Updated 11 months ago
- ☆27Nov 29, 2022Updated 3 years ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 2 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆415Jun 23, 2024Updated last year
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Jul 22, 2025Updated 8 months ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Nov 26, 2020Updated 5 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆150May 1, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons☆14Dec 22, 2022Updated 3 years ago
- ☆18Apr 11, 2021Updated 5 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆40Sep 15, 2022Updated 3 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Mar 20, 2023Updated 3 years ago
- Tools to estimate the correlation of different text-based evaluation measures for Automatic Image Description☆10Feb 2, 2017Updated 9 years ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- Human-free quality estimation of document summaries☆97Dec 1, 2025Updated 4 months ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆213Nov 20, 2023Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆13Nov 14, 2022Updated 3 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Multilingual abstractive summarization dataset extracted from WikiHow.☆99Mar 14, 2025Updated last year
- ☆39Jan 9, 2023Updated 3 years ago
- Implementation of the paper "FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations (NAACL 2022)"☆50Jul 26, 2023Updated 2 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆146Aug 29, 2023Updated 2 years ago
- ☆30Sep 5, 2021Updated 4 years ago