smalekz / PCVC
Persian Consonant Vowel Combination (PCVC) Speech Dataset
☆19Updated 4 years ago
Alternatives and similar repositories for PCVC:
Users that are interested in PCVC are comparing it to the libraries listed below
- Grapheme To Phoneme☆71Updated 8 months ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆40Updated 8 months ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆42Updated 2 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- Long audio alignment using Kaldi☆23Updated 3 years ago
- ☆29Updated 2 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆28Updated 11 months ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 6 months ago
- Implementation of audio degradation processes☆102Updated 9 years ago
- ☆40Updated 3 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated last year
- Sharif Emotional Speech Database☆34Updated 4 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 4 years ago
- Python implementation of a few speech intelligibility prediction algorithms☆13Updated 10 months ago
- Python library for audio augmentation☆83Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Formant Tracking & Estimation☆75Updated 4 months ago
- Hybrid speech synthesiser☆28Updated 6 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox☆49Updated 5 years ago
- Crowdsourced Audio Quality Evaluation Toolkit☆52Updated 2 years ago
- NIST SPH File reader (e.g. for TEDLIUM Corpus)☆25Updated 4 years ago
- Feature extractor for DL speech processing.☆65Updated 3 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆17Updated 5 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆24Updated 3 years ago