artem179 / WLAS
The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on pytorch.
☆11Updated 7 years ago
Alternatives and similar repositories for WLAS:
Users that are interested in WLAS are comparing it to the libraries listed below
- End to End Multiview Lip Reading☆10Updated 7 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- Time Delayed NN implemented in pytorch☆81Updated 8 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 5 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- Network specification and demo☆35Updated 7 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- PyTorch bindings for Warp-CTC☆42Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆13Updated 9 years ago
- Paper: https://arxiv.org/abs/1702.02285☆63Updated 6 years ago
- FFTNet vocoder implementation☆81Updated 6 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Updated 6 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 5 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- ☆31Updated 6 years ago