ifding / wavenet-speech-to-textLinks
A PyTorch implementation of speech recognition based on DeepMind's WaveNet
☆18Updated 7 years ago
Alternatives and similar repositories for wavenet-speech-to-text
Users that are interested in wavenet-speech-to-text are comparing it to the libraries listed below
Sorting:
- ☆27Updated 6 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- Python way to Read/Write TFRecords☆64Updated 7 years ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Updated 6 years ago
- A PyTorch implementation of fast-wavenet☆93Updated 7 years ago
- FFTNet vocoder implementation☆81Updated 7 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Updated 7 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 6 years ago
- Official Tensorflow implementation of the paper "Y-Autoencoders: disentangling latent representations via sequential-encoding", Pattern R…☆52Updated 5 years ago
- Pytorch implementation of time-domain filterbanks☆112Updated 4 years ago
- Anonymous ICLR Submission☆14Updated 6 years ago
- PyTorch CTC Decoder bindings☆42Updated 7 years ago
- ☆15Updated 3 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆85Updated 6 years ago
- Time Delayed NN implemented in pytorch☆81Updated 8 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆18Updated 5 years ago
- A very naive and simple benchmark between dlib and pytorch in terms of space and time☆19Updated 5 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Updated 5 years ago
- PyTorch bindings for Warp-CTC☆43Updated 5 years ago
- Conversational AI Benchmark.☆68Updated 2 years ago
- The code for the MaD TwinNet. Demo page:☆112Updated 2 years ago
- Implementation of WaveNet with Gluon☆16Updated 6 years ago
- ☆70Updated 8 years ago
- Network specification and demo☆35Updated 8 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- Deep CNN networks for Speech Synthesis☆49Updated 7 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- Code for reproducing results in "Generative Model with Dynamic Linear Flow"☆71Updated 6 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago