noahchalifour / baidu-deepspeech2Links

A Tensorflow implementation of Baidu's Deep Speech 2 paper

☆18

Alternatives and similar repositories for baidu-deepspeech2

Users that are interested in baidu-deepspeech2 are comparing it to the libraries listed below

Sorting:

syoyo / tacotron-tts-cpp
Tacotron text to speech in C++(synthesize only)
☆76Updated 5 years ago
swshon / dialectID_e2e
End to End Dialect Identification using Convolutional Neural Network
☆52Updated 5 years ago
npuichigo / ttsflow
tensorflow speech synthesis c++ inference for voicenet
☆16Updated 6 years ago
idiap / juicer
Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).
☆62Updated 9 years ago
rajivpoddar / logmmse
LogMMSE speech enhancement/noise reduction
☆88Updated 5 years ago
tbornt / phoneme_ctc
Bidirectional dynamic RNN + CTC for phoneme recognition
☆46Updated 5 years ago
Suhee05 / Text-Independent-Speaker-Verification
Text Independent Speaker Verification Using GE2E Loss
☆84Updated 6 years ago
faroit / CountNet
Deep Neural Network for Speaker Count Estimation
☆153Updated 4 years ago
YiwenShaoStephen / pychain_example
☆48Updated 4 years ago
wangkenpu / rsrgan
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
☆59Updated 5 years ago
xingchensong / Speech-Transformer-tf2.0
transformer for ASR-systerm (via tensorflow2.0)
☆114Updated 6 years ago
xcmyz / Transformer-TTS
TTS model based on Transformer.
☆58Updated 5 years ago
Jamiroquai88 / VBDiarization
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
☆95Updated 2 years ago
moisesveleta / GOP-LSTM
Improving the Goodness of Pronunciation with DNNs and RNNs
☆32Updated 6 years ago
jpuigcerver / kaldi-decoders
Custom decoders for Kaldi
☆79Updated 6 years ago
jcsilva / multilingual-g2p
Multilingual Grapheme to Phoneme
☆50Updated 9 years ago
auspicious3000 / WaveNet-Enhancement
Speech Enhancement using Bayesian WaveNet
☆96Updated 7 years ago
prajual / Master-Voice_Prints
This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…
☆32Updated 7 years ago
dabinat / deepspeech-tools
Scripts to simplify data prepping for Mozilla DeepSpeech.
☆14Updated 5 years ago
jarfo / gcommands
Speech Commands Recognition using end-to-end deep learning models in pytorch
☆27Updated 4 years ago
ryokamoi / ppg_vc
Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)
☆81Updated 5 years ago
JRMeyer / multi-task-kaldi
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆54Updated 5 years ago
wangyu09 / exkaldi-rt
An online speech recognition extension toolkit of Kaldi
☆56Updated 4 years ago
HawkAaron / RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆139Updated 4 years ago
ZitengWang / python_kaldi_features
python codes to extract MFCC and FBANK speech features for Kaldi
☆66Updated 6 years ago
danFromTelAviv / key_words_spotting
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆36Updated 5 years ago
weedwind / CTC-speech-recognition
This is a working example of using CTC for phone recognition on TIMIT
☆50Updated 7 years ago
liyongze / lstm_speaker_verification
☆35Updated 6 years ago
supikiti / PNCC
A implementation of Power Normalized Cepstral Coefficients: PNCC
☆53Updated 5 years ago
mravanelli / pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆38Updated 7 years ago