google/wmt-mqm-human-evaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google/wmt-mqm-human-evaluation)

google / wmt-mqm-human-evaluation

☆100

Alternatives and similar repositories for wmt-mqm-human-evaluation

Users that are interested in wmt-mqm-human-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

google-research / mt-metrics-eval
View on GitHub
Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.
☆132Apr 23, 2026Updated 3 months ago
wmt-conference / wmt-format-tools
View on GitHub
Tools for formatting WMT hypothesis and test sets in XML
☆27Apr 18, 2025Updated last year
wmt-conference / wmt21-news-systems
View on GitHub
☆26Jan 9, 2023Updated 3 years ago
Unbabel / COMET
View on GitHub
A Neural Framework for MT Evaluation
☆770Apr 21, 2026Updated 3 months ago
ZurichNLP / coverage-contrastive-conditioning
View on GitHub
Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…
☆22Apr 13, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
marzenakrp / LiteraryTranslation
View on GitHub
☆24Apr 2, 2024Updated 2 years ago
sheffieldnlp / mlqe-pe
View on GitHub
Multilingual Quality Estimation and Automatic Post-editing Dataset
☆44Mar 24, 2022Updated 4 years ago
wmt-conference / wmt22-news-systems
View on GitHub
☆21Feb 13, 2023Updated 3 years ago
google-research / bleurt
View on GitHub
BLEURT is a metric for Natural Language Generation based on transfer learning.
☆794Aug 4, 2023Updated 2 years ago
google / wmt19-paraphrased-references
View on GitHub
☆15Nov 5, 2020Updated 5 years ago
wmt-conference / wmt25-general-mt
View on GitHub
☆17Nov 19, 2025Updated 8 months ago
Unbabel / smaug
View on GitHub
Python package to augment multilingual data
☆15Feb 15, 2023Updated 3 years ago
dayeonki / mt_feedback
View on GitHub
Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]
☆14Apr 3, 2026Updated 3 months ago
NLP2CT / UniTE
View on GitHub
☆13Jan 30, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
google-research / metricx
View on GitHub
☆146Jul 2, 2026Updated 3 weeks ago
EleanorJiang / BlonDe
View on GitHub
Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …
☆85Sep 21, 2023Updated 2 years ago
Coldmist-Lu / ErrorAnalysis_Prompt
View on GitHub
[ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT
☆91Oct 14, 2025Updated 9 months ago
nitikam / tangled
View on GitHub
Code, data, and additional analysis for the paper Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evalua…
☆15Aug 13, 2020Updated 5 years ago
Unbabel / OpenKiwi
View on GitHub
Open-Source Machine Translation Quality Estimation in PyTorch
☆233Jun 23, 2022Updated 4 years ago
Smu-Tan / Remedy
View on GitHub
[EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
☆16Nov 20, 2025Updated 8 months ago
hsing-wang / Awesome-LLM-MT
View on GitHub
☆254May 30, 2024Updated 2 years ago
AppraiseDev / OCELoT
View on GitHub
Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations
☆23Jul 11, 2026Updated 2 weeks ago
lucadiliello / bleurt-pytorch
View on GitHub
BLEURT implementation in PyTorch
☆38Jan 19, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
longyuewangdcu / Document-MT-LLM
View on GitHub
☆101May 2, 2023Updated 3 years ago
AppraiseDev / Appraise
View on GitHub
Appraise code used as part of WMT21 human evaluation campaign
☆30Jul 15, 2026Updated last week
facebookresearch / mlqe
View on GitHub
We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…
☆81Aug 31, 2021Updated 4 years ago
duyichao / NPDA-KNN-ST
View on GitHub
Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"
☆11Oct 26, 2022Updated 3 years ago
TharinduDR / TransQuest
View on GitHub
Transformer based translation quality estimation
☆114Jul 20, 2023Updated 3 years ago
zwhe99 / SelfTraining4UNMT
View on GitHub
[ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation
☆31Oct 6, 2023Updated 2 years ago
Helsinki-NLP / OpusFilter
View on GitHub
OpusFilter - Parallel corpus processing toolkit
☆115Jul 1, 2026Updated 3 weeks ago
Coldmist-Lu / MQM_APE
View on GitHub
[MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.
☆12Sep 24, 2024Updated last year
bnewm0609 / arxivDIGESTables
View on GitHub
☆18Sep 15, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yale-LILY / SummEval
View on GitHub
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
☆415Jun 23, 2024Updated 2 years ago
Unbabel / word-level-qe-corpus-builder
View on GitHub
Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.
☆10Sep 19, 2022Updated 3 years ago
cfedermann / Appraise
View on GitHub
Appraise evaluation system for manual evaluation of machine translation output
☆77May 7, 2021Updated 5 years ago
sunzewei2715 / Doc2Doc_NMT
View on GitHub
The repository for the paper: Rethinking Document-level Neural Machine Translation
☆25Dec 20, 2022Updated 3 years ago
rbawden / discourse-mt-test-sets
View on GitHub
☆29Jun 10, 2024Updated 2 years ago
MicrosoftTranslator / NTREX
View on GitHub
NTREX -- News Test References for MT Evaluation
☆87Jun 5, 2024Updated 2 years ago
jungokasai / THumB
View on GitHub
☆15Apr 8, 2022Updated 4 years ago