turinaf / Sagalee
Automatic Speech Recognition Dataset for Oromo Language
☆12Updated 3 weeks ago
Alternatives and similar repositories for Sagalee:
Users that are interested in Sagalee are comparing it to the libraries listed below
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆35Updated last year
- Repository for multilingual speech data resources for native languages of Zambia.☆17Updated 6 months ago
- ☆49Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆13Updated last year
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated 3 months ago
- linguistic data on the Yongning Na language☆7Updated last week
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated last year
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆19Updated 7 years ago
- This is a repository for the IGBONLP Project.☆12Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆10Updated last year
- ☆42Updated 3 years ago
- VoxAngeles Corpus☆11Updated last year
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- Morphological processing for languages of the Horn of Africa☆45Updated 3 months ago
- Different semantic models for Amharic☆19Updated last year
- ☆42Updated 7 years ago
- NLA-NU Kazakh Dependency Treebank☆10Updated 6 years ago
- A repository containing links to useful phonological software☆11Updated 2 years ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- Proposed splits for the LREC Wikipron paper☆14Updated 5 years ago
- Pronounce Arabic words☆19Updated 5 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.☆16Updated 4 years ago
- ☆10Updated last year
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆14Updated 3 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆12Updated 4 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆13Updated 3 years ago
- Curate online wolof text resources that can be used to build models☆23Updated last month
- An extension of PHOIBLE that includes features for allophones.☆10Updated last year