yqf-oo / videoqa-stan
Video Question Answering via Hierarchical Spatio-Temporal Attention Networks
☆9Updated 7 years ago
Alternatives and similar repositories for videoqa-stan:
Users that are interested in videoqa-stan are comparing it to the libraries listed below
- Lua☆57Updated 6 years ago
- videoqa,天池江之杯视频问答比赛☆13Updated 6 years ago
- SelfCriticalSequenceTrainingforImageCaptioning☆21Updated 7 years ago
- Unifying the Video and Question Attentions for Open-Ended Video Question Answering☆21Updated 5 years ago
- Code base for ZJL zero shot learning competition.☆50Updated 6 years ago
- This project is out of date, I don't remember the details inside...☆84Updated 7 years ago
- the source code of Multi-modal Circulant Fusion (MCF) for Temporal Activity Localization☆23Updated 5 years ago
- ☆83Updated 4 years ago
- Semantically consistent regularizer for zero-shot learning☆64Updated 7 years ago
- A universal and efficient framework for training well-performing light net☆124Updated 7 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆69Updated 5 years ago
- Contrastive Learning for Image Captioning☆50Updated 6 years ago
- Tensorflow implementation of Dual Attention Network☆20Updated 7 years ago
- Repository for image caption for Chinese☆28Updated 7 years ago
- A Layered Memory Network for MovieQA☆16Updated 6 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆62Updated 6 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆31Updated 7 years ago
- Structured Attentions for Visual Question Answering☆46Updated 6 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Updated 6 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Updated 6 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Updated 6 years ago
- Temporal Context Network for Activity Localization in Videos☆31Updated 7 years ago
- Implementation for our paper "Conditional Image-Text Embedding Networks"☆38Updated 4 years ago
- Reimplementation for Iterative Visual Reasoning Beyond Convolutions(CVPR2018),i've reimplemented it on pytorch according to [endernewton/…☆71Updated 6 years ago
- Code for NIPS 2018 paper, "Chain of Reasoning for Visual Question Answering"☆28Updated 6 years ago
- video captioning☆24Updated 5 years ago
- Co-attending Regions and Detections for VQA.☆40Updated 6 years ago
- ☆92Updated 7 years ago
- Code for AI Challenger contest. (Generating chinese image captions)☆213Updated 6 years ago
- Code for "Video Re-localization" in ECCV 2018☆80Updated 5 years ago