We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scale 1 to 100) generated though human evaluations that represent the quality of the translations.Paper Title Unsupervised Quality Estimation for Neural Machine Translation
☆81Aug 31, 2021Updated 4 years ago
Alternatives and similar repositories for mlqe
Users that are interested in mlqe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆43Mar 24, 2022Updated 4 years ago
- Transformer based translation quality estimation☆114Jul 20, 2023Updated 2 years ago
- Framework for neural-based Quality Estimation☆41Sep 23, 2020Updated 5 years ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆233Jun 23, 2022Updated 3 years ago
- Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.☆10Sep 19, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Translation Error Rate (TER)☆45May 25, 2018Updated 7 years ago
- KiwiCutter is a simple introduction to using OpenKiwi☆13Dec 8, 2022Updated 3 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 11 months ago
- Evaluation scripts for the 2019 machine translation quality estimation shared task☆12Mar 27, 2019Updated 7 years ago
- ☆17Nov 23, 2021Updated 4 years ago
- MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation o…☆22Oct 29, 2017Updated 8 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- WMT-2012 shared task on Quality Estimation☆18Sep 5, 2012Updated 13 years ago
- Pipelined quality estimation.☆51Aug 13, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Domain Adaptation of Neural Machine Translation by Lexicon Induction☆20Jan 3, 2020Updated 6 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- machine translation and quality estimation☆35Jan 13, 2019Updated 7 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆15Aug 31, 2021Updated 4 years ago
- Simple, fast unsupervised word aligner☆770Jul 19, 2022Updated 3 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- ☆20Oct 22, 2021Updated 4 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆37Jul 25, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 5 months ago
- Learning to Copy for Automatic Post-Editing (EMNLP 2019)☆11May 6, 2021Updated 4 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆494Feb 6, 2026Updated 2 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆12Oct 10, 2020Updated 5 years ago
- Reference-free MT Evaluation Metrics☆20Sep 24, 2022Updated 3 years ago
- Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]☆15Apr 3, 2026Updated last week
- ☆29Jun 10, 2024Updated last year
- ☆14Feb 3, 2021Updated 5 years ago
- Data and code used in our NAACL'19 paper "Selective Attention for Context-aware Neural Machine Translation"☆30Apr 12, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- bin files☆13Jan 30, 2025Updated last year
- Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.☆23Aug 20, 2021Updated 4 years ago
- Appraise code used as part of WMT21 human evaluation campaign☆30Updated this week
- 机器翻译子任务-翻译质量评价-复现 WMT2018 阿里论文结果☆20Mar 12, 2019Updated 7 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- Best Practices in Translation Memory Management☆47Dec 14, 2018Updated 7 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago