kstats / CIMALinks
☆22Updated 4 years ago
Alternatives and similar repositories for CIMA
Users that are interested in CIMA are comparing it to the libraries listed below
Sorting:
- ☆99Updated last year
- Codebase, data and models for the SummaC paper in TACL☆98Updated 6 months ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆144Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Updated 4 years ago
- GEMBA — GPT Estimation Metric Based Assessment☆121Updated last year
- a tool for calcualting character n-gram F score☆73Updated 2 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆111Updated 4 months ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆303Updated 3 months ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆104Updated 4 years ago
- ☆97Updated 3 years ago
- ☆89Updated 10 months ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆56Updated 2 years ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆157Updated 3 months ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆182Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 2 years ago
- ☆71Updated 3 years ago
- Multilingual Dialogue Datasets☆19Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆208Updated last year
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆131Updated last year
- ☆50Updated 2 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆15Updated 2 weeks ago
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆81Updated 3 weeks ago
- A dataset focused on summarization of dialogs, which represents the rich domain of Twitter customer care conversations☆32Updated last year
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Updated 9 months ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆76Updated 4 years ago
- FRANK: Factuality Evaluation Benchmark☆57Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆78Updated 3 years ago
- ☆84Updated 2 years ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆29Updated 3 years ago
- The Stanford Word Substitution (Swords) Benchmark☆32Updated 3 years ago