facebookresearch/mlqe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/mlqe)

facebookresearch / mlqe

We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scale 1 to 100) generated though human evaluations that represent the quality of the translations.Paper Title Unsupervised Quality Estimation for Neural Machine Translation

☆81

Alternatives and similar repositories for mlqe

Users that are interested in mlqe are comparing it to the libraries listed below

Sorting:

sheffieldnlp / mlqe-pe
View on GitHub
Multilingual Quality Estimation and Automatic Post-editing Dataset
☆42Mar 24, 2022Updated 3 years ago
TharinduDR / TransQuest
View on GitHub
Transformer based translation quality estimation
☆114Jul 20, 2023Updated 2 years ago
Unbabel / OpenKiwi
View on GitHub
Open-Source Machine Translation Quality Estimation in PyTorch
☆232Jun 23, 2022Updated 3 years ago
sheffieldnlp / deepQuest
View on GitHub
Framework for neural-based Quality Estimation
☆41Sep 23, 2020Updated 5 years ago
jhclark / tercom
View on GitHub
Translation Error Rate (TER)
☆45May 25, 2018Updated 7 years ago
Unbabel / word-level-qe-corpus-builder
View on GitHub
Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.
☆10Sep 19, 2022Updated 3 years ago
salesforce / localization-xml-mt
View on GitHub
A High-Quality Multilingual Dataset for Structured Documentation Translation
☆37May 1, 2025Updated 10 months ago
lilt / alignment-scripts
View on GitHub
Scripts to preprocess training and test data and to run fast_align and giza
☆107Nov 2, 2021Updated 4 years ago
JunjieHu / dali
View on GitHub
Domain Adaptation of Neural Machine Translation by Lexicon Induction
☆20Jan 3, 2020Updated 6 years ago
Unbabel / KiwiCutter
View on GitHub
KiwiCutter is a simple introduction to using OpenKiwi
☆13Dec 8, 2022Updated 3 years ago
wangqiangneu / dlcl
View on GitHub
The implementation of "Learning Deep Transformer Models for Machine Translation"
☆116Jul 25, 2024Updated last year
ghpaetzold / questplusplus
View on GitHub
Pipelined quality estimation.
☆51Aug 13, 2019Updated 6 years ago
wmt-conference / wmt21-news-systems
View on GitHub
☆26Jan 9, 2023Updated 3 years ago
lovecambi / qebrain
View on GitHub
machine translation and quality estimation
☆35Jan 13, 2019Updated 7 years ago
bitextor / bicleaner
View on GitHub
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆160Jun 18, 2024Updated last year
qe-team / marmot
View on GitHub
MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation o…
☆22Oct 29, 2017Updated 8 years ago
luismsgomes / mosestokenizer
View on GitHub
☆20Oct 22, 2021Updated 4 years ago
AppraiseDev / OCELoT
View on GitHub
Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations
☆23Nov 5, 2025Updated 3 months ago
zerocstaker / constrained_ape
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆12Oct 10, 2020Updated 5 years ago
lspecia / QualityEstimation
View on GitHub
WMT-2012 shared task on Quality Estimation
☆18Sep 5, 2012Updated 13 years ago
eval4nlp / SharedTask2021
View on GitHub
☆17Nov 23, 2021Updated 4 years ago
clab / fast_align
View on GitHub
Simple, fast unsupervised word aligner
☆767Jul 19, 2022Updated 3 years ago
wxjiao / Data-Rejuvenation
View on GitHub
Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.
☆23Aug 20, 2021Updated 4 years ago
facebookresearch / evaluation-of-nmt-bt
View on GitHub
This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …
☆15Aug 31, 2021Updated 4 years ago
THUNLP-MT / L2Copy4APE
View on GitHub
Learning to Copy for Automatic Post-Editing (EMNLP 2019)
☆11May 6, 2021Updated 4 years ago
sameenmaruf / selective-attn
View on GitHub
Data and code used in our NAACL'19 paper "Selective Attention for Context-aware Neural Machine Translation"
☆30Apr 12, 2020Updated 5 years ago
xlniu / Quality-Estimation1
View on GitHub
机器翻译子任务-翻译质量评价-复现 WMT2018 阿里论文结果
☆20Mar 12, 2019Updated 6 years ago
hplt-project / sacremoses
View on GitHub
Python port of Moses tokenizer, truecaser and normalizer
☆495Feb 6, 2026Updated 3 weeks ago
lena-voita / good-translation-wrong-in-context
View on GitHub
This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …
☆98May 12, 2020Updated 5 years ago
ShannonAI / fast-knn-nmt
View on GitHub
☆33Oct 1, 2021Updated 4 years ago
rbawden / discourse-mt-test-sets
View on GitHub
☆29Jun 10, 2024Updated last year
seilna / CNN-Units-in-NLP
View on GitHub
Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs
☆26Mar 9, 2019Updated 6 years ago
Helsinki-NLP / OpusFilter
View on GitHub
OpusFilter - Parallel corpus processing toolkit
☆115Feb 11, 2026Updated 2 weeks ago
SYSTRAN / fuzzy-match
View on GitHub
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
☆51Apr 22, 2025Updated 10 months ago
AppraiseDev / Appraise
View on GitHub
Appraise code used as part of WMT21 human evaluation campaign
☆30Dec 15, 2025Updated 2 months ago
google / wmt-mqm-human-evaluation
View on GitHub
☆98Sep 25, 2025Updated 5 months ago
Unbabel / MT-Telescope
View on GitHub
☆34Nov 22, 2021Updated 4 years ago
AIPHES / DiscoScore
View on GitHub
DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence
☆36Jul 25, 2023Updated 2 years ago
bert-nmt / bert-nmt
View on GitHub
☆361Nov 22, 2022Updated 3 years ago