maidis / awesome-machine-translation
A list of awesome Machine Translation frameworks, libraries, software and papers
☆187Updated 7 months ago
Alternatives and similar repositories for awesome-machine-translation:
Users that are interested in awesome-machine-translation are comparing it to the libraries listed below
- Multilingual sentence alignment using sentence embeddings☆108Updated 3 months ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆157Updated 10 months ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated 3 weeks ago
- Open information and community for machine translation☆73Updated last week
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆154Updated 8 months ago
- Open language modeling toolkit based on PyTorch☆84Updated last week
- Improved Sentence Alignment in Linear Time and Space☆165Updated last year
- The Open Parallel Corpus☆64Updated this week
- The FLORES+ Machine Translation Benchmark☆100Updated 3 months ago
- Bicleaner fork that uses neural networks☆39Updated 6 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆150Updated 8 months ago
- ☆239Updated 8 months ago
- Machine Translation (MT) Preparation Scripts☆31Updated 2 weeks ago
- Translation demonstrator☆31Updated 4 years ago
- Local cross-platform machine translation GUI, based on CTranslate2☆90Updated last year
- A neural word aligner based on multilingual BERT☆338Updated 2 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆306Updated 3 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆49Updated last month
- NTREX -- News Test References for MT Evaluation☆81Updated 8 months ago
- Sentence aligner☆109Updated 3 years ago
- Bilingual term extractor☆53Updated last year
- Machine Translation Web Interface for OpenNMT-py☆25Updated 3 years ago
- Efficient Low-Memory Aligner☆141Updated last month
- ☆71Updated 2 weeks ago
- ☆200Updated last month
- cLang-8 is a dataset for grammatical error correction.☆103Updated 2 years ago
- Adaptive Machine Translation with Large Language Models☆30Updated last month
- A python package for deep multilingual punctuation prediction.☆115Updated 6 months ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆177Updated 6 months ago
- State-of-the-art LLM-based translation models.☆486Updated 3 weeks ago