Niger-Volta-LTI / yoruba-adr
Automatic Diacritic Restoration of Yorùbá language Text
☆24Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for yoruba-adr
- Yorùbá language training text for NLP, ASR and TTS tasks☆73Updated last year
- Ìrànlọ́wọ́ is a utility library for analysis & (pre)processing of Yorùbá text → https://pypi.org/project/iranlowo☆17Updated last year
- Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence☆75Updated 3 years ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆54Updated 7 months ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆66Updated 2 years ago
- 📖 A curated list of resources dedicated to Natural Language Processing (NLP) in the Yoruba Language.☆22Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆92Updated 6 months ago
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆14Updated 4 years ago
- ☆40Updated 2 years ago
- ☆12Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆46Updated 10 months ago
- A multilingual lexicon of words to hurt.☆80Updated 2 weeks ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 3 months ago
- Arabic Dialect Identification on AOC data.☆23Updated 5 years ago
- Machine Translation for Africa☆278Updated 2 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆34Updated 2 years ago
- Almost state of art text generation library☆66Updated 3 weeks ago
- The University of Pittsburgh English Language Institute Corpus (PELIC) dataset☆22Updated last year
- ☆105Updated 11 months ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆190Updated 4 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆32Updated last year
- WNUT-2020 Task 2: Identification of informative COVID-19 English Tweets☆30Updated 4 months ago
- Efficient Low-Memory Aligner☆139Updated 2 months ago
- A Simple Flask App to interact with your Machine Translation Model☆11Updated 4 years ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆32Updated 5 months ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆32Updated 5 months ago
- XED multilingual emotion datasets☆56Updated last year
- A python port of SimpleNLG☆25Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago