Helsinki-NLP / Opus-MT
Open neural machine translation models and web services
☆665Updated 3 months ago
Alternatives and similar repositories for Opus-MT:
Users that are interested in Opus-MT are comparing it to the libraries listed below
- Training open neural machine translation models☆354Updated 6 months ago
- Fast Neural Machine Translation in C++☆1,300Updated last year
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆160Updated 10 months ago
- Bitextor generates translation memories from multilingual websites☆291Updated 4 months ago
- A neural word aligner based on multilingual BERT☆339Updated 3 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆267Updated last month
- Fast inference engine for Transformer models☆3,659Updated 2 weeks ago
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languages☆1,214Updated last year
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆221Updated last year
- Facebook Low Resource (FLoRes) MT Benchmark☆722Updated last year
- A Neural Framework for MT Evaluation☆551Updated 2 months ago
- State-of-the-art LLM-based translation models.☆500Updated last month
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆359Updated last year
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,109Updated 2 months ago
- Fast and customizable text tokenization library with BPE and SentencePiece support☆302Updated 6 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆241Updated 2 years ago
- A list of awesome Machine Translation frameworks, libraries, software and papers☆189Updated 7 months ago
- Library for translating between 200 languages. Built on 🤗 transformers.☆473Updated 6 months ago
- NeuSpell: A Neural Spelling Correction Toolkit☆690Updated last year
- Tools to download and cleanup Common Crawl data☆992Updated last year
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆208Updated 4 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆155Updated 8 months ago
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆878Updated last week
- Crosslingual Generalization through Multitask Finetuning☆529Updated 5 months ago
- Improved Sentence Alignment in Linear Time and Space☆168Updated 2 years ago
- Meta's "No Language Left Behind" models served as web app and REST API☆204Updated 6 months ago
- Training scripts for Argos Translate☆128Updated 3 months ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆574Updated last year
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆747Updated 5 months ago
- The FLORES+ Machine Translation Benchmark☆101Updated 4 months ago