Repository for DEMETR: Diagnosing Evaluation Metrics for Translation
☆17Nov 29, 2022Updated 3 years ago
Alternatives and similar repositories for demetr
Users that are interested in demetr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20May 14, 2022Updated 3 years ago
- ☆29Dec 2, 2024Updated last year
- ☆35Dec 17, 2025Updated 4 months ago
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆15Nov 20, 2025Updated 5 months ago
- ☆24Apr 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Sep 1, 2021Updated 4 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Domain-Specific Text Generation for Machine Translation (with LLMs) - scripts and config files for the paper☆18Aug 19, 2023Updated 2 years ago
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆129Apr 23, 2026Updated 2 weeks ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- ☆18Mar 20, 2019Updated 7 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- ☆21Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆139Aug 2, 2023Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 4 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- To help search, filter, and download papers from 'acl anthology' (https://aclanthology.org/).☆18Sep 12, 2024Updated last year
- ☆12Jan 30, 2023Updated 3 years ago
- ☆47Mar 25, 2025Updated last year
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆84Sep 21, 2023Updated 2 years ago
- human_detectors hosts the data released from the paper "People who frequently use ChatGPT for writing tasks are accurate and robust detec…☆50May 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NLP course @ CS Faculty, HSE☆15Mar 4, 2020Updated 6 years ago
- ☆54Oct 24, 2024Updated last year
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 7 months ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 2 years ago
- ☆12Jul 15, 2024Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 7 months ago
- ☆22Sep 19, 2023Updated 2 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- A Neural Framework for MT Evaluation☆745Apr 21, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆75Jul 2, 2021Updated 4 years ago
- Data and all☆14Sep 30, 2019Updated 6 years ago
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Jul 29, 2022Updated 3 years ago
- Examples for the Spartan HPC cluster.☆10Sep 2, 2019Updated 6 years ago
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆43Mar 24, 2022Updated 4 years ago