noahchalifour / baidu-deepspeech2Links
A Tensorflow implementation of Baidu's Deep Speech 2 paper
☆18Updated 6 years ago
Alternatives and similar repositories for baidu-deepspeech2
Users that are interested in baidu-deepspeech2 are comparing it to the libraries listed below
Sorting:
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 6 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆62Updated 9 years ago
- LogMMSE speech enhancement/noise reduction☆88Updated 5 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 5 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Deep Neural Network for Speaker Count Estimation☆153Updated 4 years ago
- ☆48Updated 4 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- TTS model based on Transformer.☆58Updated 5 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Custom decoders for Kaldi☆79Updated 6 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Speech Enhancement using Bayesian WaveNet☆96Updated 7 years ago
- This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…☆32Updated 7 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Speech Commands Recognition using end-to-end deep learning models in pytorch☆27Updated 4 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 5 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Updated 4 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆66Updated 6 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 7 years ago
- ☆35Updated 6 years ago
- A implementation of Power Normalized Cepstral Coefficients: PNCC☆53Updated 5 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago