(NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations
☆15Apr 14, 2025Updated 11 months ago
Alternatives and similar repositories for mt_feedback
Users that are interested in mt_feedback are comparing it to the libraries listed below
Sorting:
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated 3 weeks ago
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆42Mar 24, 2022Updated 3 years ago
- ☆30Nov 14, 2025Updated 4 months ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- ☆11Sep 19, 2025Updated 6 months ago
- ☆29Dec 2, 2024Updated last year
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆127Oct 13, 2025Updated 5 months ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Jun 7, 2021Updated 4 years ago
- Pipelined quality estimation.☆51Aug 13, 2019Updated 6 years ago
- Implementation of NAACL 2024 paper Unveiling the Generalization Power of Fine-Tuned Large Language Models☆11Mar 14, 2024Updated 2 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- ☆24Apr 2, 2024Updated last year
- Frontend of LINDAT translation service☆24Mar 6, 2026Updated 2 weeks ago
- ☆24Dec 2, 2023Updated 2 years ago
- ☆98Sep 25, 2025Updated 5 months ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated last month
- ☆26Jan 9, 2023Updated 3 years ago
- Constrained decoding utilities for text generation using Huggingface seq2seq models☆25Jan 25, 2023Updated 3 years ago
- Tool for comparison and evaluation of machine translation.☆56May 17, 2022Updated 3 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Apr 13, 2023Updated 2 years ago
- Lightweight self-hosted span annotation tool☆39Mar 12, 2026Updated last week
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- ☆82Jan 30, 2026Updated last month
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Aug 31, 2021Updated 4 years ago
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Interactive tutorial on the Forward-Backward Expectation Maximization algorithm☆30Dec 15, 2015Updated 10 years ago
- ☆12May 10, 2017Updated 8 years ago
- Markdown-compatible AI-Powered Terminal Notepad☆14Apr 24, 2025Updated 10 months ago
- Appraise code used as part of WMT21 human evaluation campaign☆30Dec 15, 2025Updated 3 months ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- Hot swap key mapping for macOS☆11Jan 18, 2026Updated 2 months ago
- Seed Machine Translation Data☆33Nov 12, 2024Updated last year
- ☆10May 28, 2024Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆59Feb 10, 2025Updated last year