JoergFranke / phoneme_recognition
Phoneme Recognition using RecNet
☆98Updated 8 years ago
Alternatives and similar repositories for phoneme_recognition:
Users that are interested in phoneme_recognition are comparing it to the libraries listed below
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- Implementation of audio degradation processes☆102Updated 9 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆119Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 7 years ago
- Extract xvector and ivector under kaldi☆109Updated 6 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- ☆130Updated 6 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 4 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 3 years ago
- Pulse Model vocoder☆42Updated 6 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆89Updated 7 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 4 years ago
- parallel wavenet based on nsynth☆107Updated 6 years ago
- ☆58Updated 5 years ago
- Voice Activity Detector☆73Updated 2 years ago
- ☆60Updated 4 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆102Updated 6 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago
- A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"☆145Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- Tacotron 2 implementation☆87Updated 7 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated last year
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆100Updated 8 years ago