marzenakrp / demetr
Repository for DEMETR: Diagnosing Evaluation Metrics for Translation
☆15Updated 2 years ago
Alternatives and similar repositories for demetr
Users that are interested in demetr are comparing it to the libraries listed below
Sorting:
- ☆27Updated 5 months ago
- The geometry of multilingual language model representations (EMNLP 2022).☆20Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆55Updated 2 years ago
- ☆20Updated 5 months ago
- A repository with the code related to experiments around context-aware machine translation☆50Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆143Updated 2 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Updated 3 years ago
- ☆71Updated 3 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆108Updated 2 months ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- ☆15Updated 3 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- ☆58Updated 3 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆10Updated this week
- ☆48Updated 2 years ago
- ☆24Updated 11 months ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆39Updated last year
- Tool to perform paired evaluation of automatic systems☆12Updated 3 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆40Updated last year
- ☆62Updated 2 years ago
- Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…☆31Updated 2 years ago
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Updated last year
- ☆100Updated 2 years ago
- ☆86Updated 7 months ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- ☆39Updated 3 years ago
- A library of translation-based text similarity measures☆25Updated last year
- ☆25Updated 2 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆88Updated 3 years ago