masakhane-io / masakhane-communityLinks
All our community docs! Start here! Lets put Africa on the NLP Map
☆62Updated last year
Alternatives and similar repositories for masakhane-community
Users that are interested in masakhane-community are comparing it to the libraries listed below
Sorting:
- Machine Translation for Africa☆298Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- ☆115Updated last month
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆201Updated 5 years ago
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆16Updated 5 years ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆81Updated 2 years ago
- Description Describes the IndicNLP corpus and associated datasets☆188Updated 2 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆133Updated last year
- Curated list of publicly available parallel corpus for Indian Languages☆36Updated 4 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 3 years ago
- Agile reading group that works☆13Updated 3 years ago
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆289Updated 2 years ago
- Awesome List of Tamil NLP & AI Resources☆114Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Updated last year
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated last year
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Updated 3 years ago
- ☆17Updated 2 years ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 4 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆34Updated 8 months ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆77Updated 3 years ago
- RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week☆28Updated 4 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆171Updated 2 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆34Updated last month
- Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence☆80Updated 4 years ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- this is where we share notebooks/projects used in your youtube channel☆149Updated 4 years ago
- Catalog of abusive language data (PLoS 2020)☆319Updated last year
- A benchmark for code-switched NLP, ACL 2020☆75Updated last year
- ☆43Updated 3 years ago
- Some notebooks for NLP☆207Updated 2 years ago