masakhane-io / masakhane-communityLinks
All our community docs! Start here! Lets put Africa on the NLP Map
☆60Updated last year
Alternatives and similar repositories for masakhane-community
Users that are interested in masakhane-community are comparing it to the libraries listed below
Sorting:
- Machine Translation for Africa☆294Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆109Updated last year
- ☆110Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆199Updated 5 years ago
- Description Describes the IndicNLP corpus and associated datasets☆181Updated 2 years ago
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆287Updated 2 years ago
- RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week☆28Updated 4 years ago
- A Collection of Research Papers by Data Science Nigeria☆27Updated 3 months ago
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆16Updated 4 years ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆80Updated 2 years ago
- Hindi NLP work☆14Updated 3 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- 📄 A repo containing notes and discussions for our weekly NLP/ML paper discussions.☆150Updated 5 years ago
- Agile reading group that works☆13Updated 3 years ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 4 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆34Updated 4 years ago
- ☆43Updated 3 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- State of the Art Language models and Classifier for Bengali, which is primarily spoken by the Bengalis in South Asia.☆32Updated 5 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆170Updated 2 years ago
- ☆17Updated 2 years ago
- State of the Art Language models and Classifier for Tamil language (spoken in India, and few other South Asian countries)☆53Updated 5 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆75Updated 3 years ago
- DRIFT is a tool for Diachronic Analysis of Scientific Literature.☆115Updated 2 years ago
- Code Repository for the IndicXNLI paper.☆15Updated 2 years ago
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020☆601Updated 5 years ago
- Crosslingual Question Answering for African Languages☆31Updated 11 months ago
- Some notebooks for NLP☆207Updated last year
- ☆32Updated last year
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆130Updated last year