klauscc / lipnet-replication
A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading
☆28Updated 7 years ago
Alternatives and similar repositories for lipnet-replication:
Users that are interested in lipnet-replication are comparing it to the libraries listed below
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- A Layered Memory Network for MovieQA☆16Updated 6 years ago
- Attempts to understand deep learning and the Tensorflow RNN api by implementing a (very)crude version of the google DeViSE paper(2013).☆7Updated 8 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 6 years ago
- https://arxiv.org/abs/1707.00836☆21Updated 7 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆68Updated 5 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- A tensorflow implementation of Wide Residual Networks(https://arxiv.org/abs/1605.07146)☆21Updated 6 years ago
- Fast-Slow Recurrent Neural Networks☆14Updated 7 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Updated 7 years ago
- ☆17Updated 7 years ago
- Torch implementation for Stacked Attention Networks☆23Updated 8 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 6 years ago
- My experiments in lip reading using deep learning with the LRW dataset☆51Updated 4 years ago
- code for triplet GAN☆31Updated 6 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- We propose a new variant GAN model to deal with image generation and transformation,especially in facial attributes area.☆12Updated 7 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 7 years ago
- A PyTorch implementation of DenseNet, supporting multiclass and multilabel classification.☆24Updated 7 years ago
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Updated 6 years ago
- ☆15Updated 7 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- The code for shuttleNet.☆31Updated 7 years ago
- A simplistic web app for annotating emotions in human speech video recordings.☆28Updated 10 years ago
- Label Distribution Learning Forest☆30Updated 7 years ago
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"☆12Updated 5 years ago
- ☆29Updated 7 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Updated 7 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Updated 5 years ago
- code for Emotion Recognition in the Wild (EmotiW) challenge☆38Updated 6 years ago