Toluwase / Word-Level-Language-Identification-for-Resource-Scarce-
English, Hausa, Igbo and Yoruba corpora and results (presented in excel files) of word-level language identification research using the character trigram of the featured languages
☆15Updated 6 years ago
Alternatives and similar repositories for Word-Level-Language-Identification-for-Resource-Scarce-:
Users that are interested in Word-Level-Language-Identification-for-Resource-Scarce- are comparing it to the libraries listed below
- Yorùbá language training text for NLP, ASR and TTS tasks☆76Updated 2 years ago
- Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence☆78Updated 4 years ago
- Automatic Diacritic Restoration of Yorùbá language Text☆24Updated 8 months ago
- This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.☆16Updated 4 years ago
- A curated list of research papers and resources on code-switching☆310Updated 3 months ago
- ☆49Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆103Updated 11 months ago
- Ìrànlọ́wọ́ is a utility library for analysis & (pre)processing of Yorùbá text → https://pypi.org/project/iranlowo☆19Updated 2 years ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆59Updated 11 months ago
- This is the repository for my version of Kaldi for Dummies example.☆17Updated 6 years ago
- Machine Translation for Africa☆288Updated 2 years ago
- CMU Wilderness Multilingual Speech Dataset☆278Updated 5 years ago
- State of the Art Language models and Classifier for Bengali, which is primarily spoken by the Bengalis in South Asia.☆32Updated 4 years ago
- Hindi POS Tags and keywords using TNT model. Created Date: 28 Sept 2018☆25Updated 5 years ago
- Python library for converting numbers to words for all Indian Languages.☆35Updated 3 months ago
- ☆42Updated 3 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆224Updated 4 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- ☆43Updated 2 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆32Updated 3 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago
- Server framework for Kaldi ASR Toolkit☆97Updated last year
- ☆110Updated last year
- Datasets and tools for basic natural language processing.☆380Updated 3 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆162Updated 9 months ago
- 🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).☆380Updated 2 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆125Updated last year
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- A Python based API to access Indian language WordNets.☆39Updated 2 years ago
- ☆49Updated 6 years ago