SEERNET / DeepSpeaker

☆11

Related projects: ⓘ

erogol / FFTNet
FFTNet vocoder implementation
☆81Updated 5 years ago
jcsilva / deep-clustering
☆71Updated 7 years ago
louiskirsch / tensorflow-with-kenlm
Tensorflow with KenLM integrated for beam search scoring
☆34Updated 7 years ago
artem179 / WLAS
The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…
☆11Updated 6 years ago
udibr / LRE
NIST Language i-vector Machine Learning Challenge
☆27Updated 8 years ago
MU94W / Tacotron
TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS
☆16Updated 6 years ago
geyang / char2wav_pytorch
pytorch implementation of lyre.ai's char2wav model
☆32Updated 7 years ago
JeremyCCHsu / vc-vawgan
Network specification and demo
☆35Updated 7 years ago
transfer-learning-asr / transfer-learning-asr
Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017
☆46Updated 7 years ago
zxie / nn
☆19Updated 9 years ago
Yolanda-Gao / VoiceGAN
These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB
☆50Updated 5 years ago
syhw / timit_tools
tools around preparing TIMIT for HMM (with HTK) and deep learning (with Theano) methods
☆78Updated 9 years ago
hlt-mt / TranscRater
An open-source tool for automatic speech recognition ASR quality estimation.
☆23Updated 4 years ago
yiwangbaidu / notes
☆21Updated this week
usernaamee / audio-GAN
☆66Updated 8 years ago
vrenkens / nabu
Code for end-to-end ASR with neural networks, build with TensorFlow
☆108Updated 5 years ago
jb1999 / eesen
The official repository of the Eesen project
☆12Updated 6 years ago
galv / voice-conversion
torch7 module to convert one person's voice to another's.
☆16Updated 8 years ago
shamidreza / dnnmapper
Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…
☆33Updated 6 years ago
alumae / kaldi-nnet-dur-model
Neural network phone duration model on top of the Kaldi speech recognition framework
☆25Updated 8 years ago
teganmaharaj / zoneout
Code for experiments with our RNN regularizer, which stochastically forces units to maintain previous values.
☆79Updated 6 years ago
tam17aki / zoneout-tensorflow
An implementation of zoneout regularizer on LSTM-RNN by Tensorflow
☆25Updated 7 years ago
viswavi / languageid
Identifying the language of input text using character-level n-grams, with support for 45 languages
☆11Updated last year
craffel / mad
Code for "Online and Linear Time Attention by Enforcing Monotonic Alignments"
☆92Updated 6 years ago
naxingyu / lstm-rnn
Portal of Johannes and Felix's RNN implementation and further modifications for ASR
☆21Updated 9 years ago
iammrhelo / speech2vec
☆12Updated this week
ZhangAustin / Deep-Speech
Deep Learning for Speech Recogntion based on Theano
☆15Updated 7 years ago
dhpollack / fast-wavenet.pytorch
A PyTorch implementation of fast-wavenet
☆92Updated 6 years ago
akashmjn / cs224n-gpu-that-talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
☆52Updated 5 years ago
gchrupala / visually-grounded-speech
Representations of language in a model of visually grounded speech signal.
☆23Updated 6 years ago