Toluwase / Word-Level-Language-Identification-for-Resource-Scarce-Links
English, Hausa, Igbo and Yoruba corpora and results (presented in excel files) of word-level language identification research using the character trigram of the featured languages
☆15Updated 6 years ago
Alternatives and similar repositories for Word-Level-Language-Identification-for-Resource-Scarce-
Users that are interested in Word-Level-Language-Identification-for-Resource-Scarce- are comparing it to the libraries listed below
Sorting:
- Yorùbá language training text for NLP, ASR and TTS tasks☆76Updated 2 years ago
- A Simple Flask App to interact with your Machine Translation Model☆12Updated 5 years ago
- Automatic Diacritic Restoration of Yorùbá language Text☆24Updated 10 months ago
- ☆14Updated 2 years ago
- ☆43Updated 7 years ago
- Automatic Dialect Detection Repository☆39Updated 2 years ago
- ☆51Updated 3 years ago
- This is a repository for the IGBONLP Project.☆12Updated 3 years ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆60Updated last year
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- Arabic support for textblob☆85Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- 📖 A curated list of resources dedicated to Natural Language Processing (NLP) in the Yoruba Language.☆22Updated 4 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆14Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆105Updated last year
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.☆47Updated 11 months ago
- ☆42Updated 3 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- Agile reading group that works☆13Updated 3 years ago
- This repository☆30Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- Explore the content of Arabic text datasets.☆18Updated 3 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Updated 11 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago
- ☆25Updated 5 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 6 years ago
- Curate online wolof text resources that can be used to build models☆23Updated last month
- Arabic edition of BERT pretrained language models☆129Updated 4 years ago
- Machine Translation for Africa☆289Updated 2 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆13Updated 3 years ago