Training open neural machine translation models
☆401Jan 17, 2026Updated last month
Alternatives and similar repositories for OPUS-MT-train
Users that are interested in OPUS-MT-train are comparing it to the libraries listed below
Sorting:
- Open neural machine translation models and web services☆777Feb 23, 2026Updated last week
- ☆846Aug 20, 2024Updated last year
- Fast Neural Machine Translation in C++☆1,426Aug 25, 2023Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated 3 weeks ago
- (NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations☆15Apr 14, 2025Updated 10 months ago
- Translation demonstrator☆37May 12, 2020Updated 5 years ago
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languages☆1,253Dec 21, 2023Updated 2 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- ☆82Jan 30, 2026Updated last month
- Training scripts for Argos Translate☆154Jan 18, 2026Updated last month
- Fast inference engine for Transformer models☆4,342Feb 4, 2026Updated last month
- Open information and community for machine translation☆81Updated this week
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆390Nov 7, 2023Updated 2 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆766Nov 20, 2023Updated 2 years ago
- ☆21Feb 13, 2023Updated 3 years ago
- Improved Sentence Alignment in Linear Time and Space☆192Mar 6, 2023Updated 3 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆232Jun 23, 2022Updated 3 years ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆77Jun 23, 2025Updated 8 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 2 months ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- A neural word aligner based on multilingual BERT☆373Mar 10, 2022Updated 3 years ago
- 🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…☆11Jun 8, 2021Updated 4 years ago
- Example of building a working Spanish-to-English translation model with Marian NMT☆23May 3, 2020Updated 5 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆495Feb 6, 2026Updated last month
- Open language modeling toolkit based on PyTorch☆176Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,170Sep 30, 2025Updated 5 months ago
- NMT with ssp☆11Oct 28, 2021Updated 4 years ago
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Jul 13, 2017Updated 8 years ago
- Language-Agnostic SEntence Representations☆3,659May 2, 2024Updated last year
- OPUS-CAT is a collection of software which make it possible to OPUS-MT neural machine translation models in professional translation. OPU…☆84Feb 4, 2025Updated last year
- Bitextor generates translation memories from multilingual websites☆301Nov 11, 2024Updated last year
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61May 10, 2021Updated 4 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 5 months ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,677Updated this week
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- ☆31Jun 28, 2022Updated 3 years ago
- Open Source Neural Machine Translation and (Large) Language Models in PyTorch☆6,997Oct 14, 2025Updated 4 months ago