artem179 / WLAS
The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on pytorch.
☆11Updated 6 years ago
Alternatives and similar repositories for WLAS:
Users that are interested in WLAS are comparing it to the libraries listed below
- End to End Multiview Lip Reading☆10Updated 7 years ago
- Time Delayed NN implemented in pytorch☆80Updated 7 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Example implementation of Monotonic Chunkwise Attention.☆51Updated 6 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- FFTNet vocoder implementation☆81Updated 6 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 6 years ago
- Python toolkit for Visual Speech Recognition☆37Updated 4 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- ☆19Updated 6 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago