maidis / awesome-machine-translationView external linksLinks
A list of awesome Machine Translation frameworks, libraries, software and papers
☆197Jul 15, 2024Updated last year
Alternatives and similar repositories for awesome-machine-translation
Users that are interested in awesome-machine-translation are comparing it to the libraries listed below
Sorting:
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 3 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- ☆10Mar 22, 2024Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- ☆34Nov 22, 2021Updated 4 years ago
- Speech Denoising project for the Deep Learning course at Tsinghua University, spring 2021☆18Sep 22, 2021Updated 4 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 3 months ago
- PyTorch implementation of NMT models along with custom tokenizers, models, and datasets☆21Aug 1, 2022Updated 3 years ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆25Updated this week
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- ☆10Oct 21, 2022Updated 3 years ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆175Dec 28, 2025Updated last month
- Local cross-platform machine translation GUI, based on CTranslate2☆99Jan 9, 2024Updated 2 years ago
- A library of translation-based text similarity measures☆25Dec 11, 2023Updated 2 years ago
- Translation Error Rate (TER)☆45May 25, 2018Updated 7 years ago
- ☆254May 30, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 5 months ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- Adaptive Machine Translation with Large Language Models☆32Jan 4, 2025Updated last year
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆16Aug 31, 2022Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 4 months ago
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆226Nov 12, 2024Updated last year
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- ☆16Sep 28, 2023Updated 2 years ago
- This repository is used to publish our codes for the conference paper "Vietnamese punctuation prediction using deep neural networks" at S…☆11Jul 11, 2020Updated 5 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- This code helps to retrieve all papers from conferences and rank them by the number of (Google Scholar) citations.☆12Dec 12, 2021Updated 4 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61May 10, 2021Updated 4 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆35Feb 5, 2026Updated last week
- Bilingual term extractor☆59Nov 19, 2025Updated 2 months ago