Yale-LILY / SummEvalLinks

Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper

☆403

Alternatives and similar repositories for SummEval

Users that are interested in SummEval are comparing it to the libraries listed below

Sorting:

salesforce / factCC
Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper
☆305Updated 5 months ago
allenai / unifiedqa
UnifiedQA: Crossing Format Boundaries With a Single QA System
☆441Updated 3 years ago
AIPHES / emnlp19-moverscore
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
☆209Updated last year
google-deepmind / xquad
☆203Updated 3 years ago
neulab / BARTScore
BARTScore: Evaluating Generated Text as Text Generation
☆358Updated 3 years ago
facebookresearch / anli
Adversarial Natural Language Inference Benchmark
☆396Updated 3 years ago
shauryr / ACL-anthology-corpus
This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs
☆184Updated 2 years ago
allenai / PRIMER
The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization
☆157Updated 2 years ago
patil-suraj / exploring-T5
A repo to explore different NLP tasks which can be solved using T5
☆172Updated 4 years ago
castorini / pygaggle
a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
☆351Updated last year
google-research-datasets / tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆316Updated 5 years ago
facebookresearch / PAQ
Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"
☆207Updated 4 years ago
allenai / scifact
Data and models for the SciFact verification task.
☆241Updated 2 years ago
neulab / ExplainaBoard
Interpretable Evaluation for AI Systems
☆364Updated 2 years ago
facebookresearch / ELI5
Scripts and links to recreate the ELI5 dataset.
☆327Updated 4 years ago
danieldeutsch / sacrerouge
SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.
☆145Updated 3 years ago
google-research / bleurt
BLEURT is a metric for Natural Language Generation based on transfer learning.
☆762Updated 2 years ago
google-research-datasets / boolean-questions
☆171Updated 6 years ago
rtmdrr / testSignificanceNLP
☆230Updated 4 years ago
bjascob / amrlib
A python library that makes AMR parsing, generation and visualization simple.
☆249Updated last year
allenai / naacl2021-longdoc-tutorial
☆345Updated 4 years ago
yg211 / bert_nli
A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)
☆136Updated last year
princeton-nlp / DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…
☆606Updated 3 years ago
timoschick / dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆189Updated 4 years ago
facebookresearch / MLQA
New dataset
☆308Updated 4 years ago
Yale-LILY / dart
Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"
☆155Updated 2 years ago
facebookresearch / SEAL
Search Engines with Autoregressive Language models
☆293Updated 2 years ago
google-research-datasets / xsum_hallucination_annotations
Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…
☆84Updated 4 years ago
feralvam / easse
Easier Automatic Sentence Simplification Evaluation
☆161Updated 2 years ago
ThomasScialom / QuestEval
☆100Updated last year