masakhane-io / masakhane-mt
Machine Translation for Africa
☆288Updated 2 years ago
Alternatives and similar repositories for masakhane-mt:
Users that are interested in masakhane-mt are comparing it to the libraries listed below
- All our community docs! Start here! Lets put Africa on the NLP Map☆59Updated 11 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆103Updated 11 months ago
- ☆109Updated last year
- ☆12Updated 3 years ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆76Updated 2 years ago
- ☆42Updated 3 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆72Updated 2 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆15Updated 4 years ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆193Updated 4 years ago
- Crosslingual Question Answering for African Languages☆29Updated 6 months ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago
- MAFAND-MT☆55Updated 8 months ago
- ☆17Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated 10 months ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated 3 months ago
- A Collection of Research Papers by Data Science Nigeria☆26Updated last year
- Automatic Diacritic Restoration of Yorùbá language Text☆24Updated 8 months ago
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆281Updated last year
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- Agile reading group that works☆13Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 3 weeks ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Stanford's Alexa Prize socialbot☆133Updated last year
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆73Updated 7 months ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆33Updated 2 years ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 4 years ago
- A benchmark for code-switched NLP, ACL 2020☆74Updated 10 months ago
- A Simple Flask App to interact with your Machine Translation Model☆12Updated 5 years ago