klauscc / lipnet-replication
A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading
☆27Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for lipnet-replication
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Updated 3 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- https://arxiv.org/abs/1707.00836☆22Updated 7 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 5 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆42Updated 7 years ago
- We propose a new variant GAN model to deal with image generation and transformation,especially in facial attributes area.☆12Updated 7 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 5 years ago
- A simplistic web app for annotating emotions in human speech video recordings.☆27Updated 10 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆20Updated 8 years ago
- ☆29Updated 7 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆65Updated 5 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆21Updated 5 years ago
- A Layered Memory Network for MovieQA☆16Updated 6 years ago
- A Lip Reading Neural Network using LSTM, implemented upon keras☆17Updated 8 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Updated 7 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- convenience utilities for model validation☆23Updated 5 years ago
- My experiments in lip reading using deep learning with the LRW dataset☆51Updated 3 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- code for Emotion Recognition in the Wild (EmotiW) challenge☆37Updated 5 years ago
- Torch implementation for Stacked Attention Networks☆24Updated 7 years ago
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Updated 6 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆20Updated last year
- Fast-Slow Recurrent Neural Networks☆14Updated 6 years ago
- a list of recent papers on transfer learning☆24Updated 6 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 6 years ago