arielephrat / vid2speech
Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17
☆116Updated 8 years ago
Alternatives and similar repositories for vid2speech
Users that are interested in vid2speech are comparing it to the libraries listed below
Sorting:
- Singing Voice Separation via Recurrent Inference and Skip-Filtering Connections - PyTorch Implementation. Demo:☆171Updated 6 years ago
- ☆71Updated 8 years ago
- ☆66Updated 8 years ago
- FFTNet vocoder implementation☆81Updated 6 years ago
- Torch implementation for audio neural style.☆140Updated 8 years ago
- A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆205Updated 6 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 7 years ago
- A listen attend and spell reimplementation in tensorflow, using a custom attention mechanism.☆44Updated 6 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 6 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 6 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- A PyTorch implementation of fast-wavenet☆92Updated 7 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- ☆64Updated 6 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆92Updated 6 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 4 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆109Updated 6 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- It is a Tutorial, not a complete implement☆55Updated 6 years ago
- The code for the MaD TwinNet. Demo page:☆111Updated 2 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- Speech Recognition Using Tacotron☆163Updated 7 years ago
- Torch code for using Residual Networks with LSTMs for Lipreading☆98Updated 6 years ago
- Code to demonstrate multimodal LSTM☆36Updated last year
- ☆40Updated 6 years ago
- MSc AI Project on generative deep networks and neural style transfer for audio