hoaaoh / Audio2Vec
Audio2Vec with multi lingual
☆8Updated 6 years ago
Related projects: ⓘ
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆126Updated 5 years ago
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆52Updated 5 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- A listen attend and spell reimplementation in tensorflow, using a custom attention mechanism.☆44Updated 5 years ago
- ☆48Updated 3 years ago
- parallel wavenet based on nsynth☆105Updated 5 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 6 years ago
- Tensorflow implementation of Nvidia Waveglow☆41Updated 5 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆50Updated 2 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- Wavenet and its applications with Tensorflow☆56Updated 6 years ago
- Fast spectrogram phase recovery using Local Weighted Sums (C/Python/Matlab)☆110Updated 9 months ago
- An implementation of rnn transducer for sequence labeling problem☆22Updated 6 years ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆80Updated 2 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆111Updated 4 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 4 years ago
- an tutorial implement of voice conversion using pytorch☆35Updated 6 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆55Updated 6 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆136Updated 3 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆78Updated 4 years ago
- TTS model based on Transformer.☆57Updated 5 years ago
- Open Source WFST-based Decoder Toolkit☆75Updated 8 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 6 years ago
- Fast parallel RNN-Transducer.☆10Updated 4 years ago
- A system works on singing voice synthesis☆78Updated last year
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 4 years ago
- Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API …☆89Updated 5 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 7 years ago