Yale-LILY / SummEvalLinks
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
☆409Updated last year
Alternatives and similar repositories for SummEval
Users that are interested in SummEval are comparing it to the libraries listed below
Sorting:
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆305Updated 7 months ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆442Updated 3 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆366Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆318Updated 5 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆187Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆210Updated 2 years ago
- Adversarial Natural Language Inference Benchmark☆396Updated 3 years ago
- ☆206Updated 4 years ago
- Interpretable Evaluation for AI Systems☆365Updated 2 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆138Updated last year
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆286Updated 2 years ago
- Scripts and links to recreate the ELI5 dataset.☆326Updated 4 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆148Updated 3 years ago
- ☆345Updated 4 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆352Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACL☆106Updated 10 months ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆207Updated 4 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆775Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Updated 4 years ago
- ☆231Updated 4 years ago
- A python library that makes AMR parsing, generation and visualization simple.☆254Updated last year
- Few-shot Learning of GPT-3☆356Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆165Updated 2 years ago
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆184Updated last year
- Multi-hop dense retrieval for question answering☆216Updated 4 years ago
- ☆176Updated 6 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension and question answerin…☆223Updated 2 years ago
- Data and models for the SciFact verification task.☆245Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Updated 5 years ago