voletiv / syncnet-in-keras
Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.
☆51Updated 6 years ago
Alternatives and similar repositories for syncnet-in-keras
Users that are interested in syncnet-in-keras are comparing it to the libraries listed below
Sorting:
- AVSpeech downloader☆67Updated 6 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆71Updated 5 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆83Updated 6 years ago
- A fast cnn-based vocoder☆78Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 4 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year
- Include some core functions and model to handle speech separation☆155Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- Looking to listen at cocktail party☆36Updated 2 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- Code to train and run Blow☆143Updated 5 years ago
- The pytorch implementation of DC-TTS☆76Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Updated 4 years ago
- Implementation of GAN architectures for Voice Conversion☆51Updated 6 years ago
- ☆56Updated 6 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Updated 2 years ago
- ☆54Updated 7 years ago
- Speech Enhancement using Bayesian WaveNet☆96Updated 7 years ago
- Network specification and demo☆35Updated 7 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆82Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 4 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆73Updated 5 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 7 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆125Updated 6 years ago
- Python toolkit for Visual Speech Recognition☆37Updated 4 years ago
- Deep Convolution Text to Speech☆35Updated 7 years ago