ThomasScialom / QuestEvalLinks
☆98Updated last year
Alternatives and similar repositories for QuestEval
Users that are interested in QuestEval are comparing it to the libraries listed below
Sorting:
- Codebase, data and models for the SummaC paper in TACL☆96Updated 4 months ago
- ☆48Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆56Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆144Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Updated 2 years ago
- ☆38Updated 2 years ago
- ☆33Updated 2 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆103Updated 4 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆126Updated last year
- ☆58Updated 3 years ago
- ☆71Updated 3 years ago
- ☆82Updated 2 years ago
- code associated with ACL 2021 DExperts paper☆115Updated 2 years ago
- ☆15Updated 3 years ago
- Automatic metrics for GEM tasks☆66Updated 2 years ago
- ☆27Updated 6 months ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆74Updated 3 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- ☆46Updated 2 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆119Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆56Updated 2 years ago
- ☆92Updated 3 years ago
- ☆48Updated 2 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Updated 2 years ago
- ☆59Updated 2 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆100Updated 2 years ago