WindQAQ / listen-attend-and-spellLinks
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API of Tensorflow, which makes the training and evaluation truly end-to-end.
☆89Updated 6 years ago
Alternatives and similar repositories for listen-attend-and-spell
Users that are interested in listen-attend-and-spell are comparing it to the libraries listed below
Sorting:
- ASR with PyTorch☆139Updated 6 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Updated 4 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆122Updated 5 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 6 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆174Updated 4 years ago
- A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆200Updated 6 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆198Updated 2 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆100Updated 8 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆119Updated 5 years ago
- A listen attend and spell reimplementation in tensorflow, using a custom attention mechanism.☆44Updated 6 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆84Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆125Updated 6 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- Parallel WaveNet Vocoder Based on ClariNet☆145Updated 6 years ago
- ☆45Updated 6 years ago
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆235Updated 5 years ago
- experiments with RETURNN☆158Updated 3 weeks ago
- INTERSPEECH 2019 Tutorial Materials☆193Updated 4 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆102Updated 6 years ago
- A pytorch implementation of xvector embedding☆79Updated 5 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆32Updated 5 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated 2 years ago
- CUDA-Warp RNN-Transducer☆212Updated 2 years ago
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆81Updated 7 years ago