Toluwase / Word-Level-Language-Identification-for-Resource-Scarce-
English, Hausa, Igbo and Yoruba corpora and results (presented in excel files) of word-level language identification research using the character trigram of the featured languages
☆14Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Word-Level-Language-Identification-for-Resource-Scarce-
- A Simple Flask App to interact with your Machine Translation Model☆11Updated 4 years ago
- ☆48Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆91Updated 6 months ago
- An intent classifier which can classifies a query into one of the 21 given intents.☆74Updated 5 years ago
- Machine Translation for Africa☆277Updated 2 years ago
- Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence☆75Updated 3 years ago
- Agile reading group that works☆13Updated 2 years ago
- ☆42Updated 6 years ago
- Aspect based sentiment analysis for Hindi☆11Updated 7 years ago
- ☆40Updated 2 years ago
- Extractive Text Summarization in Python☆20Updated 6 years ago
- ☆12Updated 5 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- Part-of-Speech Tagging Models in Python☆15Updated 5 years ago
- Arabic Dialect Identification on AOC data.☆23Updated 5 years ago
- This is the repository for my version of Kaldi for Dummies example.☆17Updated 5 years ago
- FAQ's answering chatbot using open source chatbot framework Rasa Stack☆34Updated 6 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 4 years ago
- Ìrànlọ́wọ́ is a utility library for analysis & (pre)processing of Yorùbá text → https://pypi.org/project/iranlowo☆17Updated last year
- Spoken Language assessment☆41Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆112Updated 5 years ago
- This is a package in Python which implements a tokenizer, stemmer for Hindi language☆90Updated 4 years ago
- Chatbot based on Rasa Framework☆37Updated 5 years ago
- ☆68Updated last year
- ☆48Updated 5 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated 2 years ago
- Deep Learning neural network for correcting spelling☆54Updated last year
- A module for normalising text.☆173Updated 3 years ago
- Unsupervised Speaker Clustering & Speaker Recognition☆12Updated 5 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆13Updated 4 years ago