marzenakrp / demetrLinks
Repository for DEMETR: Diagnosing Evaluation Metrics for Translation
☆15Updated 2 years ago
Alternatives and similar repositories for demetr
Users that are interested in demetr are comparing it to the libraries listed below
Sorting:
- ☆28Updated 9 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆16Updated last month
- ☆15Updated 4 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆84Updated 4 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆115Updated 6 months ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Updated 4 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆145Updated 2 years ago
- ☆100Updated last year
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Updated 2 years ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Updated 10 months ago
- ☆24Updated last year
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆56Updated 3 years ago
- ☆58Updated 3 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 5 years ago
- FRANK: Factuality Evaluation Benchmark☆59Updated 2 years ago
- ☆54Updated 3 years ago
- ☆71Updated 3 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Updated last year
- ☆46Updated 2 years ago
- A framework for evaluating Machine Translation models.☆10Updated 3 months ago
- ☆25Updated 2 years ago
- ☆30Updated 2 years ago
- A repository with the code related to experiments around context-aware machine translation☆51Updated 3 years ago
- Automatic metrics for GEM tasks☆67Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- code associated with ACL 2021 DExperts paper☆116Updated 2 years ago
- ☆20Updated last year
- ☆100Updated 3 years ago
- The geometry of multilingual language model representations (EMNLP 2022).☆21Updated 2 years ago