ifding / wavenet-speech-to-text
A PyTorch implementation of speech recognition based on DeepMind's WaveNet
☆18Updated 6 years ago
Alternatives and similar repositories for wavenet-speech-to-text:
Users that are interested in wavenet-speech-to-text are comparing it to the libraries listed below
- Anonymous ICLR Submission☆14Updated 5 years ago
- ☆27Updated 5 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 5 years ago
- Losses and decoders for end-to-end ASR and OCR☆33Updated 4 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 3 years ago
- Example implementation of Monotonic Chunkwise Attention.☆51Updated 6 years ago
- [Deprecated] PyTorch Lite is a lightweight machine learning framework for on-device mobile inference.☆15Updated 5 years ago
- Network specification and demo☆35Updated 7 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- PyTorch CTC Decoder bindings☆14Updated 7 years ago
- Code for our paper: "Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers".☆21Updated 3 years ago
- A simplistic web app for annotating emotions in human speech video recordings.☆27Updated 10 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 5 years ago
- Examples of cleaning up raw voices☆18Updated 2 years ago
- PyTorch bindings for Warp-CTC☆42Updated 5 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- A fast cnn-based vocoder☆78Updated 4 years ago
- ASR project with pytorch-lightning☆20Updated 4 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- A monster repo for random research, not organized in any particular way☆13Updated 8 years ago
- ☆58Updated 3 years ago
- FFTNet vocoder implementation☆81Updated 6 years ago