voletiv / syncnet-in-kerasLinks
Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.
☆51Updated 6 years ago
Alternatives and similar repositories for syncnet-in-keras
Users that are interested in syncnet-in-keras are comparing it to the libraries listed below
Sorting:
- AVSpeech downloader☆67Updated 6 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Updated 2 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆84Updated 6 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆104Updated last year
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆71Updated 5 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 7 years ago
- Include some core functions and model to handle speech separation☆155Updated 3 years ago
- Looking to listen at cocktail party☆36Updated 2 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- ☆56Updated 6 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆108Updated last year
- A fast cnn-based vocoder☆78Updated 4 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆82Updated 4 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 10 months ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆102Updated 5 years ago
- Python toolkit for Visual Speech Recognition☆37Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- ☆31Updated 6 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆74Updated 5 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆160Updated 5 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Torch code for using Residual Networks with LSTMs for Lipreading☆98Updated 6 years ago
- ☆40Updated 6 years ago
- An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …☆114Updated 4 years ago