jeyajseo / MASN-pytorchLinks

pytorch implementation for the paper Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering

☆2

Alternatives and similar repositories for MASN-pytorch

Users that are interested in MASN-pytorch are comparing it to the libraries listed below

Sorting:

thaolmk54 / hcrn-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆133Updated last year
SunDoge / L-GCN
PyTorch implementation of L-GCN [https://arxiv.org/abs/2008.09105]
☆25Updated 4 years ago
vsislab / Controllable_XGating
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Updated 5 years ago
fanchenyou / HME-VideoQA
Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
☆54Updated 3 years ago
ikuinen / CMIN_moment_retrieval
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆86Updated 4 years ago
noagarcia / knowit-rock
ROCK model for Knowledge-Based VQA in Videos
☆30Updated 4 years ago
aioz-ai / ICCV19_VQA-CTI
Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)
☆38Updated 2 years ago
yytzsy / SCDM
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Updated 3 years ago
SydCaption / SAAT
☆62Updated 4 years ago
yytzsy / ABLR_code
The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"
☆29Updated 6 years ago
XgDuan / WSDEC
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Updated 5 years ago
mrsalehi / ground-sentence-video
Implementation of the EMNLP 2018 paper "Temporally Grounding Natural Sentence in Video" using PyTorch
☆2Updated 2 years ago
niluthpol / weak_supervised_video_moment
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Updated 5 years ago
Sha-Lab / CMHSE
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Updated 6 years ago
zfchenUnique / WSSTG
This repository contains the main baselines introduced in WSSTG (ACL 2019).
☆55Updated last year
YiwuZhong / Sub-GC
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
☆97Updated 11 months ago
escorciav / moments-retrieval-page
Moments Retrieval Project Webpage (temporal)
☆31Updated last year
dazhang-cv / MAN
This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"
☆17Updated 6 years ago
crodriguezo / TMLGA
Repository of proposal-free temporal moment localization work
☆33Updated last year
tgc1997 / RMN
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Updated 4 years ago
JonghwanMun / LGI4temporalgrounding
Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"
☆131Updated 4 years ago
devtrace404 / Video-Captioning-Using-Object-Trajectory-Features
Video Captioning on MSR-VTT and MSVD dataset using Deep Learning
☆21Updated 4 years ago
jayleicn / TVQAplus
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆129Updated 2 years ago
ttengwang / dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
☆75Updated 3 years ago
youngfly11 / LCMCG-PyTorch
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"
☆58Updated 3 years ago
sunnychencool / AOQ
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Updated 5 years ago
StanfordVL / STGraph
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Updated 5 years ago
daqingliu / NMTree
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆39Updated 5 years ago
hobincar / SGN
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Updated 4 years ago
LuoweiZhou / anet2016-cuhk-feature
Feature Extraction Toolbox from CUHK&ETHZ&SIAT submission to ActivityNet 2016
☆32Updated 6 years ago