SEERNET / DeepSpeaker
☆11Updated this week
Related projects: ⓘ
- FFTNet vocoder implementation☆81Updated 5 years ago
- ☆71Updated 7 years ago
- Tensorflow with KenLM integrated for beam search scoring☆34Updated 7 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- NIST Language i-vector Machine Learning Challenge☆27Updated 8 years ago
- TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS☆16Updated 6 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 7 years ago
- Network specification and demo☆35Updated 7 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 7 years ago
- ☆19Updated 9 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 5 years ago
- tools around preparing TIMIT for HMM (with HTK) and deep learning (with Theano) methods☆78Updated 9 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 4 years ago
- ☆21Updated this week
- ☆66Updated 8 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆108Updated 5 years ago
- The official repository of the Eesen project☆12Updated 6 years ago
- torch7 module to convert one person's voice to another's.☆16Updated 8 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- Neural network phone duration model on top of the Kaldi speech recognition framework☆25Updated 8 years ago
- Code for experiments with our RNN regularizer, which stochastically forces units to maintain previous values.☆79Updated 6 years ago
- An implementation of zoneout regularizer on LSTM-RNN by Tensorflow☆25Updated 7 years ago
- Identifying the language of input text using character-level n-grams, with support for 45 languages☆11Updated last year
- Code for "Online and Linear Time Attention by Enforcing Monotonic Alignments"☆92Updated 6 years ago
- Portal of Johannes and Felix's RNN implementation and further modifications for ASR☆21Updated 9 years ago
- ☆12Updated this week
- Deep Learning for Speech Recogntion based on Theano☆15Updated 7 years ago
- A PyTorch implementation of fast-wavenet☆92Updated 6 years ago
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆52Updated 5 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago