voletiv / syncnet-in-keras
Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.
☆51Updated 5 years ago
Alternatives and similar repositories for syncnet-in-keras:
Users that are interested in syncnet-in-keras are comparing it to the libraries listed below
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- AVSpeech downloader☆66Updated 5 years ago
- Looking to listen at cocktail party☆36Updated last year
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆69Updated 5 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆44Updated 4 years ago
- A fast cnn-based vocoder☆78Updated 4 years ago
- ☆54Updated 6 years ago
- Python toolkit for Visual Speech Recognition☆37Updated 4 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆99Updated 10 months ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 5 months ago
- Tensorflow Implementation of Expressive Tacotron☆197Updated 6 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆83Updated 6 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆43Updated 3 years ago
- Paper: https://arxiv.org/abs/1702.02285☆63Updated 6 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆82Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder☆146Updated 5 years ago
- Include some core functions and model to handle speech separation☆155Updated 3 years ago
- Speech Enhancement using Bayesian WaveNet☆96Updated 6 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆83Updated 5 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆22Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Torch code for using Residual Networks with LSTMs for Lipreading☆99Updated 6 years ago
- The pytorch implementation of DC-TTS☆76Updated 6 years ago