We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scale 1 to 100) generated though human evaluations that represent the quality of the translations.Paper Title Unsupervised Quality Estimation for Neural Machine Translation
☆81Aug 31, 2021Updated 4 years ago
Alternatives and similar repositories for mlqe
Users that are interested in mlqe are comparing it to the libraries listed below
Sorting:
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆42Mar 24, 2022Updated 3 years ago
- Transformer based translation quality estimation☆114Jul 20, 2023Updated 2 years ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆232Jun 23, 2022Updated 3 years ago
- Framework for neural-based Quality Estimation☆41Sep 23, 2020Updated 5 years ago
- Translation Error Rate (TER)☆45May 25, 2018Updated 7 years ago
- Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.☆10Sep 19, 2022Updated 3 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 10 months ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Domain Adaptation of Neural Machine Translation by Lexicon Induction☆20Jan 3, 2020Updated 6 years ago
- KiwiCutter is a simple introduction to using OpenKiwi☆13Dec 8, 2022Updated 3 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- Pipelined quality estimation.☆51Aug 13, 2019Updated 6 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- machine translation and quality estimation☆35Jan 13, 2019Updated 7 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation o…☆22Oct 29, 2017Updated 8 years ago
- ☆20Oct 22, 2021Updated 4 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 3 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆12Oct 10, 2020Updated 5 years ago
- WMT-2012 shared task on Quality Estimation☆18Sep 5, 2012Updated 13 years ago
- ☆17Nov 23, 2021Updated 4 years ago
- Simple, fast unsupervised word aligner☆767Jul 19, 2022Updated 3 years ago
- Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.☆23Aug 20, 2021Updated 4 years ago
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆15Aug 31, 2021Updated 4 years ago
- Learning to Copy for Automatic Post-Editing (EMNLP 2019)☆11May 6, 2021Updated 4 years ago
- Data and code used in our NAACL'19 paper "Selective Attention for Context-aware Neural Machine Translation"☆30Apr 12, 2020Updated 5 years ago
- 机器翻译子任务-翻译质量评价-复现 WMT2018 阿里论文结果☆20Mar 12, 2019Updated 6 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆495Feb 6, 2026Updated 3 weeks ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆98May 12, 2020Updated 5 years ago
- ☆33Oct 1, 2021Updated 4 years ago
- ☆29Jun 10, 2024Updated last year
- Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs☆26Mar 9, 2019Updated 6 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated 2 weeks ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆51Apr 22, 2025Updated 10 months ago
- Appraise code used as part of WMT21 human evaluation campaign☆30Dec 15, 2025Updated 2 months ago
- ☆98Sep 25, 2025Updated 5 months ago
- ☆34Nov 22, 2021Updated 4 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆36Jul 25, 2023Updated 2 years ago
- ☆361Nov 22, 2022Updated 3 years ago