ajseo17 / MASN-pytorchLinks

pytorch implementation for the paper Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering

☆2

Alternatives and similar repositories for MASN-pytorch

Users that are interested in MASN-pytorch are comparing it to the libraries listed below

Sorting:

fanchenyou / HME-VideoQA
Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
☆54Updated 3 years ago
niluthpol / weak_supervised_video_moment
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Updated 4 years ago
noagarcia / knowit-rock
ROCK model for Knowledge-Based VQA in Videos
☆30Updated 4 years ago
InterDigitalInc / DialogSummary-VideoQA
☆10Updated 3 years ago
yytzsy / SCDM
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Updated 3 years ago
escorciav / moments-retrieval-page
Moments Retrieval Project Webpage (temporal)
☆31Updated last year
SunDoge / L-GCN
PyTorch implementation of L-GCN [https://arxiv.org/abs/2008.09105]
☆25Updated 4 years ago
yytzsy / ABLR_code
The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"
☆29Updated 6 years ago
Sha-Lab / CMHSE
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Updated 6 years ago
zfchenUnique / WSSTG
This repository contains the main baselines introduced in WSSTG (ACL 2019).
☆55Updated 10 months ago
ikuinen / CMIN_moment_retrieval
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆86Updated 4 years ago
StanfordVL / STGraph
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Updated 5 years ago
TheShadow29 / vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆67Updated 4 years ago
dazhang-cv / MAN
This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"
☆17Updated 6 years ago
vsislab / Controllable_XGating
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆67Updated 5 years ago
aioz-ai / ICCV19_VQA-CTI
Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)
☆38Updated 2 years ago
crodriguezo / TMLGA
Repository of proposal-free temporal moment localization work
☆33Updated 11 months ago
jayleicn / TVQAplus
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆129Updated 2 years ago
devtrace404 / Video-Captioning-Using-Object-Trajectory-Features
Video Captioning on MSR-VTT and MSVD dataset using Deep Learning
☆21Updated 4 years ago
mrsalehi / ground-sentence-video
Implementation of the EMNLP 2018 paper "Temporally Grounding Natural Sentence in Video" using PyTorch
☆2Updated 2 years ago
maurya-rohit / Scene-Graph-For-Videos
☆15Updated 9 months ago
SydCaption / SAAT
☆62Updated 4 years ago
salesforce / BiST
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Updated 3 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34Updated 5 years ago
tanghaoyu258 / ACRM-for-moment-retrieval
☆27Updated 2 years ago
26hzhang / ReLoCLNet
Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)
☆58Updated 3 years ago
XgDuan / WSDEC
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Updated 5 years ago
noagarcia / ROLL-VideoQA
PyTorch code for ROLL, a knowledge-based video story question answering model.
☆21Updated 4 years ago
sunnychencool / AOQ
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Updated 4 years ago
thaolmk54 / hcrn-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆133Updated 10 months ago