ifding / wavenet-speech-to-text
A PyTorch implementation of speech recognition based on DeepMind's WaveNet
☆18Updated 6 years ago
Alternatives and similar repositories for wavenet-speech-to-text:
Users that are interested in wavenet-speech-to-text are comparing it to the libraries listed below
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- ☆27Updated 6 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- Network specification and demo☆35Updated 7 years ago
- FFTNet vocoder implementation☆81Updated 6 years ago
- Pytorch Implementation of FFTNet☆86Updated 6 years ago
- ☆15Updated 7 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Generate vector embeddings for music☆18Updated 7 years ago
- ☆31Updated 6 years ago
- ☆24Updated 6 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- A pytorch implementation of FFTNet.☆37Updated 6 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- PyTorch bindings for Warp-CTC☆42Updated 5 years ago
- Example implementation of Monotonic Chunkwise Attention.☆52Updated 7 years ago
- Quasi-Recurrent Neural Network (QRNN) for Tensorflow☆23Updated 6 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- ☆21Updated 7 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Updated 7 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- A Chainer implementation of Fast WaveNet(mel-spectrogram vocoder).☆31Updated 6 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 7 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- ASR project with pytorch-lightning☆20Updated last month
- speech-to-text in pytorch☆80Updated 6 years ago