kstats / CIMA
☆21Updated 3 years ago
Alternatives and similar repositories for CIMA:
Users that are interested in CIMA are comparing it to the libraries listed below
- ☆97Updated 2 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 3 years ago
- ☆16Updated 3 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆101Updated 2 weeks ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 7 months ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 4 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- ☆9Updated 2 years ago
- ☆15Updated 3 years ago
- ☆97Updated 2 years ago
- ☆21Updated 10 months ago
- ☆48Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆54Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆142Updated 2 years ago
- ☆44Updated 3 years ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆14Updated 4 months ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆55Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 3 weeks ago
- Dense hybrid representations for text retrieval☆62Updated last year
- ☆82Updated 2 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆101Updated 4 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Updated 4 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 2 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆21Updated 2 weeks ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆71Updated 3 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆36Updated 2 years ago