aistairc / seq2seq_temporal_attention
Generating Video Description using Sequence-to-sequence Model with Temporal Attention
☆32Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for seq2seq_temporal_attention
- Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018☆172Updated 6 years ago
- Implementation for our paper "Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues."☆39Updated 7 years ago
- ☆31Updated 6 years ago
- Localize objects in images using referring expressions☆37Updated 8 years ago
- Code and demos for our paper at ACM MM 2017☆63Updated 5 years ago
- Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)☆107Updated 6 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆15Updated 8 years ago
- Here we describe a new approach to train a video captioning neural network , that is not only based on the normal cross entropy loss for …☆8Updated 4 years ago
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Updated 6 years ago
- Contains approaches introduced in the MovieQA benchmark dataset paper☆80Updated 7 years ago
- Using Semantic Compositional Networks for Video Captioning☆97Updated 5 years ago
- The implementation of Temporal Generative Adversarial Nets with Singular Value Clipping☆78Updated 4 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 6 years ago
- VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation☆22Updated 7 years ago
- Attentive Semantic Video Generation using Captions☆36Updated 7 years ago
- Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017☆10Updated 7 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆22Updated 4 years ago
- ☆17Updated 6 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 5 years ago
- Sentence/Caption evaluation using automated metrics☆61Updated 8 years ago
- Small Flask-based apps to browse the Flickr30k dataset.☆20Updated 7 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Updated 5 years ago
- Temporal augmentation with two-stream ConvNet features on human action recognition☆18Updated 7 years ago
- Evaluation code for Dense-Captioning Events in Videos☆124Updated 5 years ago
- Soft attention mechanism for video caption generation☆156Updated 7 years ago
- ☆60Updated 6 years ago
- image caption generation using chainer!☆64Updated 5 years ago
- Novel Object Captioner - Captioning Images with diverse objects☆41Updated 6 years ago