Helsinki-NLP / Opus-MT
Open neural machine translation models and web services
β622Updated last month
Related projects β
Alternatives and complementary repositories for Opus-MT
- Training open neural machine translation modelsβ336Updated 3 months ago
- Library for translating between 200 languages. Built on π€ transformers.β454Updated 2 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB teβ¦β251Updated last month
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languagesβ1,172Updated 11 months ago
- A neural word aligner based on multilingual BERTβ328Updated 2 years ago
- Training scripts for Argos Translateβ121Updated this week
- Facebook Low Resource (FLoRes) MT Benchmarkβ704Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.β230Updated 2 years ago
- Multilingual sentence alignment using sentence embeddingsβ101Updated 2 weeks ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.β153Updated 7 months ago
- Bitextor generates translation memories from multilingual websitesβ291Updated last week
- Tools to download and cleanup Common Crawl dataβ971Updated last year
- Fast Neural Machine Translation in C++β1,254Updated last year
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processingβ736Updated last month
- Improved Sentence Alignment in Linear Time and Spaceβ163Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.β150Updated 5 months ago
- Open language modeling toolkit based on PyTorchβ61Updated this week
- State-of-the-art LLM-based translation models.β437Updated last month
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)β351Updated last year
- 80x faster and 95% accurate language identification with Fasttextβ141Updated 9 months ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisonsβ1,068Updated 3 months ago
- Crosslingual Generalization through Multitask Finetuningβ516Updated last month
- A Neural Framework for MT Evaluationβ508Updated 3 months ago
- β‘ boost inference speed of T5 models by 5x & reduce the model size by 3x.β566Updated last year
- β193Updated this week
- A tool that locates, downloads, and extracts machine translation corporaβ147Updated 5 months ago
- Meta's "No Language Left Behind" models served as web app and REST APIβ182Updated 3 months ago
- β1,252Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.β697Updated last year
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.β733Updated last week