ifding / wavenet-speech-to-textLinks
A PyTorch implementation of speech recognition based on DeepMind's WaveNet
☆18Updated 7 years ago
Alternatives and similar repositories for wavenet-speech-to-text
Users that are interested in wavenet-speech-to-text are comparing it to the libraries listed below
Sorting:
- FFTNet vocoder implementation☆81Updated 6 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- Network specification and demo☆35Updated 8 years ago
- Source code for "Towards a Deeper Understanding of Adversarial Losses under a Discriminative Adversarial Network Setting"☆42Updated 2 years ago
- Examples of cleaning up raw voices☆18Updated 3 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- ☆27Updated 6 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆93Updated 6 years ago
- PyTorch bindings for Warp-CTC☆43Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Pytorch Implementation of FFTNet☆86Updated 7 years ago
- ☆31Updated 6 years ago
- A fast cnn-based vocoder☆78Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- A monster repo for random research, not organized in any particular way☆13Updated 8 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 7 years ago
- A Chainer implementation of Fast WaveNet(mel-spectrogram vocoder).☆31Updated 6 years ago
- ☆12Updated 7 years ago
- Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"☆17Updated 6 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- ☆8Updated 7 years ago
- ☆15Updated 3 years ago
- A Chainer implementation of WaveGlow.☆41Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 6 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 7 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year