Toluwase / Word-Level-Language-Identification-for-Resource-Scarce-
English, Hausa, Igbo and Yoruba corpora and results (presented in excel files) of word-level language identification research using the character trigram of the featured languages
☆15Updated 6 years ago
Alternatives and similar repositories for Word-Level-Language-Identification-for-Resource-Scarce-:
Users that are interested in Word-Level-Language-Identification-for-Resource-Scarce- are comparing it to the libraries listed below
- Yorùbá language training text for NLP, ASR and TTS tasks☆74Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆102Updated 10 months ago
- Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence☆77Updated 4 years ago
- Automatic Diacritic Restoration of Yorùbá language Text☆24Updated 7 months ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆57Updated 10 months ago
- Hindi NLP work☆14Updated 2 years ago
- Machine Translation for Africa☆282Updated 2 years ago
- Ìrànlọ́wọ́ is a utility library for analysis & (pre)processing of Yorùbá text → https://pypi.org/project/iranlowo☆18Updated 2 years ago
- An intent classifier which can classifies a query into one of the 21 given intents.☆74Updated 6 years ago
- ☆13Updated 4 years ago
- 📖 A curated list of resources dedicated to Natural Language Processing (NLP) in the Yoruba Language.☆22Updated 4 years ago
- ☆43Updated 9 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Updated 10 years ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated 6 months ago
- A curated list of research papers and resources on code-switching☆307Updated 2 months ago
- Resources to go with the Indic NLP Library☆73Updated 2 years ago
- Xlit-Crowd: Hindi-English Transliteration Corpus☆37Updated 10 years ago
- Arabic speech recognition and dialect identification (Red Hen Lab - GSoC 2018)☆17Updated 4 years ago
- ☆49Updated 3 years ago
- A Python based API to access Indian language WordNets.☆39Updated 2 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆40Updated 2 years ago
- ☆69Updated last year
- This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.☆16Updated 4 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆14Updated 2 years ago
- Python library for converting numbers to words for all Indian Languages.☆35Updated last month
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆46Updated 4 years ago
- A Simple Flask App to interact with your Machine Translation Model☆12Updated 5 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated 2 months ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆13Updated 3 years ago