klauscc / lipnet-replication
A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading
☆28Updated 7 years ago
Alternatives and similar repositories for lipnet-replication:
Users that are interested in lipnet-replication are comparing it to the libraries listed below
- Attention Bidirectional Video Recurrent Net☆56Updated 6 years ago
- https://arxiv.org/abs/1707.00836☆21Updated 7 years ago
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Updated 6 years ago
- Solution for N+1 fish, N+2 fish DrivenData competition (2nd place)☆13Updated 5 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆68Updated 5 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Updated 7 years ago
- ☆17Updated 6 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 3 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 8 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 6 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆42Updated 7 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆25Updated 6 years ago
- Interpretable Image Search by Priyam Tejaswin and Akshay Chawla☆22Updated 2 years ago
- We propose a new variant GAN model to deal with image generation and transformation,especially in facial attributes area.☆12Updated 7 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- Fast-Slow Recurrent Neural Networks☆14Updated 7 years ago
- Attempts to understand deep learning and the Tensorflow RNN api by implementing a (very)crude version of the google DeViSE paper(2013).☆7Updated 8 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆21Updated 5 years ago
- A Lip Reading Neural Network using LSTM, implemented upon keras☆17Updated 8 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆32Updated 5 years ago
- (new updates have been moved to a private repo)☆8Updated 7 years ago
- Lambda Networks implemented in PyTorch☆13Updated 4 years ago
- tf2.0 implementation of circle loss☆32Updated 4 years ago
- Code release for paper "A Modulation Module for Multi-task Learning with Applications in Image Retrieval"☆32Updated 6 years ago
- End to End Multiview Lip Reading☆10Updated 7 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Updated 2 years ago
- Pytorch implementation of pixel level domain transfer☆9Updated 6 years ago
- A Layered Memory Network for MovieQA☆16Updated 6 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆20Updated last year