Machine translation (MT) benchmark dataset for languages in the Horn of Africa.
☆45Oct 13, 2022Updated 3 years ago
Alternatives and similar repositories for HornMT
Users that are interested in HornMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆37May 27, 2023Updated 2 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- A simple program for handling Ethiopian calendar dates.☆30Dec 26, 2025Updated 4 months ago
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆17Jun 4, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…☆11Jun 8, 2021Updated 4 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆38Oct 14, 2025Updated 6 months ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆13Aug 15, 2022Updated 3 years ago
- African made ERC20 Ethereum Token☆11Aug 25, 2021Updated 4 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- Machine Translation for Africa☆314Jun 14, 2022Updated 3 years ago
- Chapa API for Java based web apps☆13Jun 9, 2024Updated last year
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆19Mar 26, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.