unza-speech-lab / zambezi-voiceLinks
Repository for multilingual speech data resources for native languages of Zambia.
☆17Updated 9 months ago
Alternatives and similar repositories for zambezi-voice
Users that are interested in zambezi-voice are comparing it to the libraries listed below
Sorting:
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆42Updated 2 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆37Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 9 months ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆44Updated 2 years ago
- phone inventory library☆16Updated 2 years ago
- ☆15Updated 4 years ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- ☆41Updated last month
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated last year
- An extension of PHOIBLE that includes features for allophones.☆10Updated 2 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆22Updated 10 months ago
- Utility functions for preprocessing PodcastFillers dataset☆9Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Simple Kaldi recipe for forced alignment☆10Updated 2 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 2 months ago
- ☆16Updated 9 months ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- ☆31Updated 2 years ago
- ☆40Updated 3 years ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- radiomixer☆14Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 9 months ago
- Whisper Speech Quality Assessment (WhiSQA)☆10Updated 7 months ago
- This is the project page of our paper "MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion".☆11Updated 4 months ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- ☆23Updated last year
- NVIDIA's FastPitch, extracted from the DeepLearningExamples repository☆13Updated last year