Yale-LILY / SummEvalLinks
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
☆403Updated last year
Alternatives and similar repositories for SummEval
Users that are interested in SummEval are comparing it to the libraries listed below
Sorting:
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆305Updated 5 months ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆441Updated 3 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆210Updated last year
- BARTScore: Evaluating Generated Text as Text Generation☆358Updated 3 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆184Updated last year
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆351Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆145Updated 2 years ago
- ☆202Updated 3 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆137Updated last year
- ☆171Updated 6 years ago
- Adversarial Natural Language Inference Benchmark☆396Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆314Updated 5 years ago
- Scripts and links to recreate the ELI5 dataset.☆327Updated 4 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 4 years ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- Codebase, data and models for the SummaC paper in TACL☆102Updated 8 months ago
- Data and models for the SciFact verification task.☆240Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆756Updated 2 years ago
- A python library that makes AMR parsing, generation and visualization simple.☆248Updated last year
- Interpretable Evaluation for AI Systems☆364Updated 2 years ago
- ☆345Updated 4 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆285Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆207Updated 4 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Updated 4 years ago
- ☆230Updated 4 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆606Updated 3 years ago
- Large-scale multi-document summarization dataset and code☆286Updated 2 years ago
- Multi-hop dense retrieval for question answering☆217Updated 3 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆155Updated 2 years ago