Yale-LILY / SummEvalLinks
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
☆401Updated last year
Alternatives and similar repositories for SummEval
Users that are interested in SummEval are comparing it to the libraries listed below
Sorting:
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆303Updated 3 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆357Updated 3 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆441Updated 3 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆209Updated last year
- ☆201Updated 3 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 2 years ago
- Adversarial Natural Language Inference Benchmark☆397Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆310Updated 5 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆182Updated last year
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆204Updated 3 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- Codebase, data and models for the SummaC paper in TACL☆98Updated 6 months ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆350Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 4 years ago
- Interpretable Evaluation for AI Systems☆367Updated 2 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆751Updated 2 years ago
- Multi-hop dense retrieval for question answering☆217Updated 3 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension and question answerin…☆221Updated 2 years ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆144Updated 2 years ago
- ☆100Updated last year
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Updated 4 years ago
- Data and models for the SciFact verification task.☆238Updated last year
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆605Updated 3 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated 2 years ago
- ☆345Updated 4 years ago
- Scripts and links to recreate the ELI5 dataset.☆326Updated 3 years ago
- Few-shot Learning of GPT-3☆353Updated last year
- ☆169Updated 6 years ago
- New dataset☆306Updated 3 years ago