asmelashteka/HornMT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/asmelashteka/HornMT)

asmelashteka / HornMT

Machine translation (MT) benchmark dataset for languages in the Horn of Africa.

☆46

Alternatives and similar repositories for HornMT

Users that are interested in HornMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

masakhane-io / africomet
View on GitHub
COMET for African languages
☆11Jan 24, 2025Updated last year
ARBML / dar
View on GitHub
A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
☆11Jun 23, 2024Updated 2 years ago
Andrews2017 / africanlp-public-datasets
View on GitHub
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
☆117Apr 26, 2024Updated 2 years ago
YerevaNN / PARASITE
View on GitHub
🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…
☆11Jun 8, 2021Updated 5 years ago
shyyhs / CourseraParallelCorpusMining
View on GitHub
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
☆15Aug 27, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hausanlp / NaijaSenti
View on GitHub
This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…
☆39Oct 14, 2025Updated 9 months ago
ehsanasgari / 1000Langs
View on GitHub
Creating super-parallel corpora of more than 1500+ unique languages for NLP research
☆33Dec 8, 2022Updated 3 years ago
ijdutse / hausa-corpus
View on GitHub
A collection of textual datasets in Hausa language and the corresponding translation in English language.
☆19Mar 5, 2021Updated 5 years ago
Neurotech-HQ / python-dpo
View on GitHub
A python package to easy the integration with Direct Online Pay (Mpesa, TigoPesa, AirtelMoney, Card Payments)
☆20Nov 19, 2021Updated 4 years ago
kevindegila / flask-joey
View on GitHub
A Simple Flask App to interact with your Machine Translation Model
☆13Feb 26, 2020Updated 6 years ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
global-asp / asp-source
View on GitHub
Source stories from the African Storybook Project in Markdown format
☆22Jan 25, 2026Updated 6 months ago
pluiez / NLLB-inference
View on GitHub
☆56Jul 16, 2022Updated 4 years ago
masakhane-io / afriqa
View on GitHub
Crosslingual Question Answering for African Languages
☆31Sep 27, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lykmapipo / tz-mpesa-ussd-push
View on GitHub
Vodacom Tanzania USSD Push API Client
☆28Dec 10, 2022Updated 3 years ago
machinetranslate / machinetranslate.org
View on GitHub
Open information and community for machine translation
☆81Jul 20, 2026Updated last week
ghrua / NgramRes
View on GitHub
☆23Nov 6, 2022Updated 3 years ago
naver / nllb-pruning
View on GitHub
Library for pruning experts per language pair in NLLB-200
☆35Jul 7, 2023Updated 3 years ago
masakhane-io / lafand-mt
View on GitHub
MAFAND-MT
☆63Jul 9, 2024Updated 2 years ago
lspecia / quest
View on GitHub
Pascal2 Harvest project QuEst
☆14Sep 15, 2014Updated 11 years ago
imcohen / segment-brain-mri
View on GitHub
Brain MRI segmentation using Kaggle dataset
☆14Apr 21, 2021Updated 5 years ago
oya163 / nepali-ner
View on GitHub
Named Entity Recognition in Nepali Language
☆10Jan 12, 2023Updated 3 years ago
luismond / tm2tb
View on GitHub
Bilingual term extractor
☆60Nov 19, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
beem-africa / python-client
View on GitHub
A Python library to ease the integration with the Beem Africa (SMS, AIRTIME, OTP, 2WAY-SMS, BPAY, USSD)
☆23Jan 14, 2024Updated 2 years ago
vincent-laizer / NECTA-API
View on GitHub
A python package to fetch results of various national examinations done in Tanzania.
☆26Nov 24, 2024Updated last year
dsridhar91 / hstm
View on GitHub
Code and data for "Heterogeneous Supervised Topic Models"
☆10Jun 27, 2022Updated 4 years ago
amasad / arabish
View on GitHub
Arabic Transliteration in Python
☆36Aug 19, 2013Updated 12 years ago
masakhane-io / masakhane-community
View on GitHub
All our community docs! Start here! Lets put Africa on the NLP Map
☆68Apr 16, 2024Updated 2 years ago
csong27 / auditing-text-generation
View on GitHub
Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)
☆10Jun 18, 2019Updated 7 years ago
kbatsuren / wiktra
View on GitHub
Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)
☆37Jun 29, 2025Updated last year
unicode-org / unilex
View on GitHub
Lexical data at Unicode
☆70Sep 1, 2024Updated last year
GILT-Forum / TM-Mgmt-Best-Practices
View on GitHub
Best Practices in Translation Memory Management
☆47Dec 14, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xiaody / adblock-minus
View on GitHub
Implements core functionality of adblocking
☆16Feb 23, 2018Updated 8 years ago
Niger-Volta-LTI / yoruba-voice
View on GitHub
Repo & Project for the Imminent Research Grant code & tasks
☆12May 20, 2024Updated 2 years ago
zeeguu / python-translators
View on GitHub
Wrappers for translation services
☆13May 26, 2025Updated last year
thammegowda / mtdata
View on GitHub
A tool that locates, downloads, and extracts machine translation corpora
☆167Apr 13, 2026Updated 3 months ago
haneul-yoo / HUE
View on GitHub
Hanja Understanding Evaluation Dataset
☆15May 2, 2022Updated 4 years ago
UBC-NLP / afrolid
View on GitHub
AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
☆39Feb 5, 2026Updated 5 months ago
masakhane-io / masakhane-ner
View on GitHub
☆122Oct 15, 2025Updated 9 months ago