noahchalifour / baidu-deepspeech2
A Tensorflow implementation of Baidu's Deep Speech 2 paper
☆18Updated 5 years ago
Alternatives and similar repositories for baidu-deepspeech2:
Users that are interested in baidu-deepspeech2 are comparing it to the libraries listed below
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 5 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 4 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆51Updated 7 years ago
- TTS model based on Transformer.☆57Updated 5 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- ☆20Updated 5 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆58Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆61Updated 9 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 7 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Updated 6 years ago
- Custom decoders for Kaldi☆79Updated 5 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆52Updated 5 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- ☆48Updated 4 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆55Updated 6 years ago
- A pitch tracker inspired by David Talkin's RAPT (Robust Algorithm for Pitch Tracking) written in Python.☆47Updated 8 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- Open Source WFST-based Decoder Toolkit☆76Updated 9 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 5 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- ☆41Updated 6 years ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Updated 4 years ago