fanchenyou/HME-VideoQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fanchenyou/HME-VideoQA)

fanchenyou / HME-VideoQA

Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA

☆55

Alternatives and similar repositories for HME-VideoQA

Users that are interested in HME-VideoQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thaolmk54 / hcrn-videoqa
View on GitHub
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆135Jul 25, 2024Updated 2 years ago
YunseokJANG / tgif-qa
View on GitHub
Repository for our CVPR 2017 and IJCV: TGIF-QA
☆180Sep 6, 2021Updated 4 years ago
jayleicn / TVQAplus
View on GitHub
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆132Oct 25, 2022Updated 3 years ago
lixiangpengcs / PSAC
View on GitHub
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
☆27Apr 15, 2021Updated 5 years ago
SunDoge / L-GCN
View on GitHub
PyTorch implementation of L-GCN [https://arxiv.org/abs/2008.09105]
☆25Apr 25, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
xudejing / video-question-answering
View on GitHub
Video Question Answering via Gradually Refined Attention over Appearance and Motion
☆178Dec 5, 2017Updated 8 years ago
salesforce / BiST
View on GitHub
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Jun 16, 2025Updated last year
ZJULearning / TreeAttention
View on GitHub
A Better Way to Attend: Attention with Trees for Video Question Answering
☆25Mar 25, 2019Updated 7 years ago
noagarcia / knowit-rock
View on GitHub
ROCK model for Knowledge-Based VQA in Videos
☆31Oct 19, 2020Updated 5 years ago
noagarcia / ROLL-VideoQA
View on GitHub
PyTorch code for ROLL, a knowledge-based video story question answering model.
☆21Sep 29, 2020Updated 5 years ago
jayleicn / TVQA
View on GitHub
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
☆181Oct 25, 2022Updated 3 years ago
SummerRaining / videoqa_keras
View on GitHub
videoqa,天池江之杯视频问答比赛
☆13Dec 19, 2018Updated 7 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
MILVLG / activitynet-qa
View on GitHub
An VideoQA dataset based on the videos from ActivityNet
☆94Nov 22, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
AmingWu / CCN
View on GitHub
Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15May 6, 2021Updated 5 years ago
chrisc36 / bottom-up-attention-vqa
View on GitHub
BottomUpTopDown VQA model with question-type debiasing
☆22Oct 6, 2019Updated 6 years ago
cshizhe / hgr_v2t
View on GitHub
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Jun 12, 2020Updated 6 years ago
jayleicn / VideoLanguageFuturePred
View on GitHub
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆52Aug 20, 2022Updated 3 years ago
jialinwu17 / self_critical_vqa
View on GitHub
Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
☆40Sep 9, 2019Updated 6 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
WuJie1010 / Awesome-Temporally-Language-Grounding
View on GitHub
A curated list of “Temporally Language Grounding” and related area
☆110Nov 28, 2019Updated 6 years ago
yj-yu / lsmdc
View on GitHub
☆33Nov 12, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
GingL / ARN
View on GitHub
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding
☆32Aug 29, 2019Updated 6 years ago
jhyuklee / dmn-pytorch
View on GitHub
Re-implementation: Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
☆14Apr 7, 2019Updated 7 years ago
NJUPT-MCC / DualVGR-VideoQA
View on GitHub
Implementation for the journal paper "DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering" (Jianyu et al., IEEE Tran…
☆18Jun 22, 2021Updated 5 years ago
MDSKUL / MasterProject
View on GitHub
Code voor mijn Master project omtrent VideoBERT
☆39Nov 25, 2020Updated 5 years ago
zfchenUnique / WSSTG
View on GitHub
This repository contains the main baselines introduced in WSSTG (ACL 2019).
☆57Jul 8, 2024Updated 2 years ago
linjieli222 / HERO
View on GitHub
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆235Sep 16, 2021Updated 4 years ago
zilongzheng / visdial-gnn
View on GitHub
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Jun 30, 2021Updated 5 years ago
facebookresearch / ActivityNet-Entities
View on GitHub
A Dataset for Grounded Video Description
☆165Jan 4, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
KaihuaTang / VQA2.0-Recent-Approachs-2018.pytorch
View on GitHub
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…
☆300Jan 6, 2026Updated 6 months ago
danieljf24 / dual_encoding
View on GitHub
[CVPR2019] Dual Encoding for Zero-Example Video Retrieval
☆153Jan 10, 2023Updated 3 years ago
princetonvisualai / SPICE-U
View on GitHub
☆11Sep 7, 2020Updated 5 years ago
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
wh0330 / CAG_VisDial
View on GitHub
☆15Aug 13, 2020Updated 5 years ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago