Niger-Volta-LTI / yoruba-adr
Automatic Diacritic Restoration of Yorùbá language Text
☆24Updated 5 months ago
Alternatives and similar repositories for yoruba-adr:
Users that are interested in yoruba-adr are comparing it to the libraries listed below
- Yorùbá language training text for NLP, ASR and TTS tasks☆73Updated last year
- Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence☆75Updated 4 years ago
- Ìrànlọ́wọ́ is a utility library for analysis & (pre)processing of Yorùbá text → https://pypi.org/project/iranlowo☆17Updated 2 years ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆54Updated 9 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆98Updated 8 months ago
- Machine Translation for Africa☆278Updated 2 years ago
- ☆106Updated last year
- Arabic Dialect Identification on AOC data.☆23Updated 5 years ago
- ☆11Updated 3 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆66Updated 2 years ago
- ☆14Updated 2 years ago
- ☆12Updated 2 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 4 years ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆48Updated 3 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 9 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆41Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆47Updated last year
- ☆12Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆30Updated last year
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- Almost state of art text generation library☆66Updated 2 months ago
- 📖 A curated list of resources dedicated to Natural Language Processing (NLP) in the Yoruba Language.☆22Updated 3 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 2 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 5 months ago
- ☆42Updated 3 years ago
- Crawler for linguistic corpora☆197Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆192Updated 4 years ago
- Runnable morphological analysis tools from the UniMorph project☆15Updated 6 years ago
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year