klauscc / lipnet-replication
A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading
☆28Updated 7 years ago
Alternatives and similar repositories for lipnet-replication:
Users that are interested in lipnet-replication are comparing it to the libraries listed below
- Fast-Slow Recurrent Neural Networks☆14Updated 7 years ago
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Updated 6 years ago
- https://arxiv.org/abs/1707.00836☆21Updated 7 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 9 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 6 years ago
- ☆29Updated 8 years ago
- Emotiw2017 code☆15Updated 7 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 6 years ago
- A Lip Reading Neural Network using LSTM, implemented upon keras☆17Updated 9 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆20Updated 5 years ago
- ☆59Updated 7 years ago
- Decoupled Learning for Conditional Adversarial Networks☆17Updated 6 years ago
- Pytorch implementation of pixel level domain transfer☆9Updated 6 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 7 years ago
- Attempts to understand deep learning and the Tensorflow RNN api by implementing a (very)crude version of the google DeViSE paper(2013).☆7Updated 8 years ago
- ☆9Updated 8 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Updated 7 years ago
- tf2.0 implementation of circle loss☆32Updated 5 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- a list of recent papers on transfer learning☆24Updated 7 years ago
- ☆13Updated 7 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆19Updated 6 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆19Updated 8 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Updated 5 years ago
- A tensorflow implementation of ByteNet with layer masking.☆10Updated 7 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Updated 2 years ago
- Mostly for using the trained weights from https://github.com/ryankiros/visual-semantic-embedding in Keras☆20Updated 9 years ago
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"☆12Updated 5 years ago