noahchalifour / baidu-deepspeech2
A Tensorflow implementation of Baidu's Deep Speech 2 paper
☆18Updated 6 years ago
Alternatives and similar repositories for baidu-deepspeech2
Users that are interested in baidu-deepspeech2 are comparing it to the libraries listed below
Sorting:
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- ☆20Updated 5 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 4 years ago
- TTS model based on Transformer.☆58Updated 5 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Custom decoders for Kaldi☆79Updated 5 years ago
- ☆48Updated 4 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Updated 7 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 8 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 6 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Updated 6 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 4 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆62Updated 9 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- c++ Kaldi IO lib (static and dynamic).☆25Updated 6 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- an tutorial implement of voice conversion using pytorch☆35Updated 7 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 7 years ago
- create CMakeLists.txt for kaldi☆20Updated 5 years ago
- The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.☆33Updated 6 years ago
- An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …☆114Updated 4 years ago
- Hybrid speech synthesiser☆28Updated 6 years ago