noahchalifour / baidu-deepspeech2
A Tensorflow implementation of Baidu's Deep Speech 2 paper
☆18Updated 6 years ago
Alternatives and similar repositories for baidu-deepspeech2:
Users that are interested in baidu-deepspeech2 are comparing it to the libraries listed below
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 6 years ago
- ☆20Updated 5 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Updated 6 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- ☆48Updated 4 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 8 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- ☆41Updated 6 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆62Updated 9 years ago
- c++ Kaldi IO lib (static and dynamic).☆25Updated 6 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- an tutorial implement of voice conversion using pytorch☆35Updated 7 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- My solution to course E6870 (Speech Recognition) of Columbia University.☆37Updated 6 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- Keyword Spotting for detecting a word in an audio file☆17Updated 5 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 7 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- ☆13Updated 4 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- Keyword Search Recipe for Subword ASR☆30Updated 5 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆51Updated 6 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago