(NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations
☆15Apr 14, 2025Updated 10 months ago
Alternatives and similar repositories for mt_feedback
Users that are interested in mt_feedback are comparing it to the libraries listed below
Sorting:
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆42Mar 24, 2022Updated 3 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Updated this week
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆126Oct 13, 2025Updated 4 months ago
- ☆30Nov 14, 2025Updated 3 months ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Jun 7, 2021Updated 4 years ago
- Making a bridge between NLP models and Brain data☆19Jun 3, 2020Updated 5 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆24Aug 15, 2025Updated 6 months ago
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- ☆24Apr 2, 2024Updated last year
- ☆98Sep 25, 2025Updated 5 months ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Apr 13, 2023Updated 2 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- Repository for "Uncertainty-Aware Machine Translation Evaluation", accepted to Findings of EMNLP 2021.☆34Sep 22, 2021Updated 4 years ago
- Tool for comparison and evaluation of machine translation.☆56May 17, 2022Updated 3 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated 2 weeks ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- Frontend of LINDAT translation service☆24Feb 21, 2026Updated last week
- Appraise code used as part of WMT21 human evaluation campaign☆30Dec 15, 2025Updated 2 months ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- Seed Machine Translation Data☆33Nov 12, 2024Updated last year
- ☆81Jan 30, 2026Updated last month
- Lightweight self-hosted span annotation tool☆39Jan 20, 2026Updated last month
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- Markdown-compatible AI-Powered Terminal Notepad☆14Apr 24, 2025Updated 10 months ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Aug 31, 2021Updated 4 years ago
- Featurize words into orthographic and phonological vectors.☆41May 20, 2023Updated 2 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 10 months ago
- Framework for neural-based Quality Estimation☆41Sep 23, 2020Updated 5 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- COMET for African languages☆10Jan 24, 2025Updated last year
- ☆14Apr 29, 2025Updated 10 months ago
- Convert ABN Amro CSV bank statements to QIF☆11Jun 8, 2017Updated 8 years ago
- ☆10May 28, 2024Updated last year
- Open-Source Machine Translation Quality Estimation in PyTorch☆232Jun 23, 2022Updated 3 years ago
- ☆32Sep 12, 2022Updated 3 years ago