klauscc / lipnet-replication
A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading
☆27Updated 6 years ago
Related projects: ⓘ
- A Layered Memory Network for MovieQA☆16Updated 6 years ago
- Code examples to learn how to use tensorflow☆15Updated 8 years ago
- https://arxiv.org/abs/1707.00836☆22Updated 6 years ago
- The source code for Temporal Attention-Gated Model.☆20Updated 7 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆33Updated 3 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 3 years ago
- ☆17Updated 6 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 5 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Updated last year
- Seminar: intro to deep learning with tensorflow☆13Updated 7 years ago
- 4th place solution to Google Cloud & YouTube-8M Video Understanding Challenge☆26Updated 7 years ago
- Fast-Slow Recurrent Neural Networks☆14Updated 6 years ago
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Updated 6 years ago
- A tensorflow implementation of Wide Residual Networks(https://arxiv.org/abs/1605.07146)☆21Updated 6 years ago
- code for triplet GAN☆31Updated 6 years ago
- Contains code for the EMNLP paper `Learning Linguistic Attributes for Zero-Shot Verb Classification'☆27Updated 6 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Updated 5 years ago
- We propose a new variant GAN model to deal with image generation and transformation,especially in facial attributes area.☆12Updated 6 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Updated 7 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆20Updated 6 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆20Updated 8 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 7 years ago
- tf2.0 implementation of circle loss☆32Updated 4 years ago
- ☆26Updated this week
- Project Uncovering Temporal Context for Video Question and Answering☆15Updated 8 years ago
- a list of recent papers on transfer learning☆24Updated 6 years ago
- ☆17Updated this week
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 3 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆42Updated 7 years ago
- Code release for paper "A Modulation Module for Multi-task Learning with Applications in Image Retrieval"☆32Updated 5 years ago