JoergFranke / phoneme_recognition
Phoneme Recognition using RecNet
☆97Updated 8 years ago
Alternatives and similar repositories for phoneme_recognition:
Users that are interested in phoneme_recognition are comparing it to the libraries listed below
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- Fast spectrogram phase recovery using Local Weighted Sums (C/Python/Matlab)☆113Updated last year
- An open-source speech separation and enhancement library☆211Updated 4 years ago
- Implementation of audio degradation processes☆101Updated 9 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 6 years ago
- ☆25Updated 7 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- ☆60Updated 4 years ago
- ☆130Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Updated 4 years ago
- ASR with PyTorch☆140Updated 6 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆99Updated 7 years ago
- Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.☆173Updated 6 years ago
- DNN-for-speech-enhancement☆174Updated 2 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆89Updated 7 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated last year
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 4 years ago
- target speaker extraction and verification for multi-talker speech☆175Updated 4 years ago
- Speech Denoising with Deep Feature Losses☆186Updated 4 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆126Updated 4 years ago
- Deep neural network based speech enhancement toolkit☆213Updated 5 years ago
- A unofficial Pytorch implementation of Google's VoiceFilter☆100Updated last year
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- Extract xvector and ivector under kaldi☆109Updated 6 years ago
- A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…☆209Updated last year
- An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …☆114Updated 4 years ago