marzenakrp / demetr
Repository for DEMETR: Diagnosing Evaluation Metrics for Translation
☆15Updated 2 years ago
Alternatives and similar repositories for demetr:
Users that are interested in demetr are comparing it to the libraries listed below
- ☆26Updated 4 months ago
- ☆15Updated 3 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Updated 3 years ago
- ☆19Updated 3 months ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated 2 years ago
- The geometry of multilingual language model representations (EMNLP 2022).☆20Updated 2 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆38Updated last year
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆14Updated 4 months ago
- FRANK: Factuality Evaluation Benchmark☆54Updated 2 years ago
- ☆45Updated 2 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 4 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆102Updated 3 weeks ago
- ☆14Updated 3 years ago
- A repository with the code related to experiments around context-aware machine translation☆48Updated 2 years ago
- ☆84Updated 6 months ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- ☆58Updated 2 years ago
- ☆18Updated last year
- ☆25Updated 2 years ago
- ☆96Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- ☆71Updated 3 years ago
- A library of translation-based text similarity measures☆25Updated last year
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆143Updated 2 years ago
- ☆23Updated last year
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆17Updated 2 years ago