klauscc / lipnet-replication
A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading
☆28Updated 7 years ago
Alternatives and similar repositories for lipnet-replication:
Users that are interested in lipnet-replication are comparing it to the libraries listed below
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆21Updated 5 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆67Updated 5 years ago
- Fast-Slow Recurrent Neural Networks☆14Updated 6 years ago
- https://arxiv.org/abs/1707.00836☆22Updated 7 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆11Updated 2 years ago
- A Layered Memory Network for MovieQA☆16Updated 6 years ago
- We propose a new variant GAN model to deal with image generation and transformation,especially in facial attributes area.☆12Updated 7 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆42Updated 7 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 5 years ago
- My experiments in lip reading using deep learning with the LRW dataset☆51Updated 3 years ago
- ☆17Updated 6 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆15Updated 8 years ago
- A Lip Reading Neural Network using LSTM, implemented upon keras☆17Updated 8 years ago
- Co-attending Regions and Detections for VQA.☆41Updated 6 years ago
- Person Recognition System on PIPA dataset☆29Updated 2 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- Label Distribution Learning Forest☆30Updated 7 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Updated 5 years ago
- Rethinking the Form of Latent States in Image Captioning☆21Updated 6 years ago
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆25Updated 6 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 8 years ago
- Using an LSTM and 4d convolutional network for lip reading☆12Updated 6 years ago
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"☆12Updated 5 years ago
- a list of recent papers on transfer learning☆24Updated 7 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Updated 6 years ago