Translation Error Rate (TER)
☆45May 25, 2018Updated 7 years ago
Alternatives and similar repositories for tercom
Users that are interested in tercom are comparing it to the libraries listed below
Sorting:
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Aug 31, 2021Updated 4 years ago
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆42Mar 24, 2022Updated 3 years ago
- Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.☆10Sep 19, 2022Updated 3 years ago
- Pipelined quality estimation.☆51Aug 13, 2019Updated 6 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task☆138Nov 5, 2020Updated 5 years ago
- ☆34Nov 22, 2021Updated 4 years ago
- Dynamic data selection for neural machine translation☆20Jan 28, 2018Updated 8 years ago
- Simple, fast unsupervised word aligner☆767Jul 19, 2022Updated 3 years ago
- Domain Adaptation of Neural Machine Translation by Lexicon Induction☆20Jan 3, 2020Updated 6 years ago
- ☆27Jan 7, 2017Updated 9 years ago
- Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better …☆204Feb 25, 2023Updated 3 years ago
- ☆23Feb 4, 2020Updated 6 years ago
- Framework for neural-based Quality Estimation☆41Sep 23, 2020Updated 5 years ago
- Collection of Evaluation Metrics and Algorithms for Machine Translation☆76Mar 5, 2018Updated 7 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- WMT-2012 shared task on Quality Estimation☆18Sep 5, 2012Updated 13 years ago
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Mar 24, 2021Updated 4 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 3 years ago
- ☆31Jun 13, 2019Updated 6 years ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,227Jan 12, 2026Updated last month
- ☆20Dec 16, 2024Updated last year
- KiwiCutter is a simple introduction to using OpenKiwi☆13Dec 8, 2022Updated 3 years ago
- Human evaluation results and translation output for the Translator Human Parity Data release☆37Mar 19, 2018Updated 7 years ago
- Code examples for CMU CS11-731, Machine Translation and Sequence-to-sequence Models☆35Nov 4, 2019Updated 6 years ago
- machine translation and quality estimation☆35Jan 13, 2019Updated 7 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 3 years ago
- Transformer based translation quality estimation☆114Jul 20, 2023Updated 2 years ago
- An educational tool to train, inspect, evaluate and translate using neural engines☆19Mar 13, 2025Updated 11 months ago
- [In-Progress] A command-line tool for Neural Machine Translation in Python & Tensorflow☆16Jan 1, 2017Updated 9 years ago
- ☆18Apr 2, 2021Updated 4 years ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆44Jul 9, 2022Updated 3 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆166May 12, 2021Updated 4 years ago