masakhane-io / masakhane-mtView external linksLinks
Machine Translation for Africa
☆309Jun 14, 2022Updated 3 years ago
Alternatives and similar repositories for masakhane-mt
Users that are interested in masakhane-mt are comparing it to the libraries listed below
Sorting:
- A Simple Flask App to interact with your Machine Translation Model☆13Feb 26, 2020Updated 5 years ago
- Crosslingual Question Answering for African Languages☆30Sep 27, 2024Updated last year
- Agile reading group that works☆13Feb 2, 2022Updated 4 years ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Apr 26, 2024Updated last year
- Masakhane Web is a translation web application for solely African Languages.☆37Aug 11, 2023Updated 2 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80May 31, 2022Updated 3 years ago
- ☆117Oct 15, 2025Updated 4 months ago
- Building an effective preprocessing tool for African languages☆13Jan 24, 2024Updated 2 years ago
- Minimalist NMT for educational purposes☆713Jan 29, 2024Updated 2 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆41Oct 13, 2022Updated 3 years ago
- MAFAND-MT☆60Jul 9, 2024Updated last year
- 🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…☆11Jun 8, 2021Updated 4 years ago
- Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence☆81Nov 23, 2020Updated 5 years ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 4 months ago
- A guide to building language technology in new languages.☆60Feb 1, 2022Updated 4 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆762Nov 20, 2023Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 5 months ago
- This is a repository of scripts developed as part of the 2020 ENCMP100 Section B3 lecture taught at University of Alberta.☆10Apr 2, 2020Updated 5 years ago
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆42Mar 16, 2021Updated 4 years ago
- ☆17Jan 12, 2023Updated 3 years ago
- Myanmar and Thai Language Resources☆10Jul 18, 2022Updated 3 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆24May 12, 2024Updated last year
- Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks☆20Mar 26, 2021Updated 4 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Jul 25, 2024Updated last year
- Towards developing a Robust Translation Model for African languages: Pilot Project FFR v1.0.☆44May 12, 2024Updated last year
- ☆18Oct 5, 2017Updated 8 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 3 months ago
- Source stories from the African Storybook Project in Markdown format☆22Jan 25, 2026Updated 3 weeks ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Tool for manual evaluation of parallel sentences.☆15Jan 26, 2026Updated 3 weeks ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆230Jun 23, 2022Updated 3 years ago
- ☆23May 12, 2024Updated last year
- ☆81Jan 30, 2026Updated 2 weeks ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 9 months ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆82Mar 3, 2023Updated 2 years ago