xudejing/video-question-answering

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xudejing/video-question-answering)

xudejing / video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

☆178

Alternatives and similar repositories for video-question-answering

Users that are interested in video-question-answering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thaolmk54 / hcrn-videoqa
View on GitHub
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆135Jul 25, 2024Updated last year
YunseokJANG / tgif-qa
View on GitHub
Repository for our CVPR 2017 and IJCV: TGIF-QA
☆180Sep 6, 2021Updated 4 years ago
MILVLG / activitynet-qa
View on GitHub
An VideoQA dataset based on the videos from ActivityNet
☆94Nov 22, 2020Updated 5 years ago
ZJULearning / videoqa
View on GitHub
Unifying the Video and Question Attentions for Open-Ended Video Question Answering
☆22Jun 17, 2019Updated 7 years ago
fanchenyou / HME-VideoQA
View on GitHub
Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
☆55Sep 13, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jayleicn / TVQA
View on GitHub
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
☆181Oct 25, 2022Updated 3 years ago
jayleicn / TVQAplus
View on GitHub
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆132Oct 25, 2022Updated 3 years ago
SummerRaining / videoqa_keras
View on GitHub
videoqa,天池江之杯视频问答比赛
☆13Dec 19, 2018Updated 7 years ago
yj-yu / lsmdc
View on GitHub
☆33Nov 12, 2018Updated 7 years ago
VRU-NExT / VideoQA
View on GitHub
☆104Oct 19, 2022Updated 3 years ago
princetonvisualai / SPICE-U
View on GitHub
☆11Sep 7, 2020Updated 5 years ago
doc-doc / NExT-QA
View on GitHub
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
☆189Aug 2, 2025Updated 11 months ago
ZJULearning / TreeAttention
View on GitHub
A Better Way to Attend: Attention with Trees for Video Question Answering
☆25Mar 25, 2019Updated 7 years ago
noagarcia / knowit-rock
View on GitHub
ROCK model for Knowledge-Based VQA in Videos
☆31Oct 19, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
antoyang / just-ask
View on GitHub
[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
☆127Sep 29, 2023Updated 2 years ago
NJUPT-MCC / DualVGR-VideoQA
View on GitHub
Implementation for the journal paper "DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering" (Jianyu et al., IEEE Tran…
☆18Jun 22, 2021Updated 5 years ago
tsujuifu / pytorch_violet
View on GitHub
A PyTorch implementation of VIOLET
☆138Dec 17, 2023Updated 2 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
jayleicn / VideoLanguageFuturePred
View on GitHub
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆52Aug 20, 2022Updated 3 years ago
Cadene / vqa.pytorch
View on GitHub
Visual Question Answering in Pytorch
☆733Dec 11, 2019Updated 6 years ago
Peratham / video2text.pytorch
View on GitHub
PyTorch implementation of video captioning
☆13Sep 24, 2017Updated 8 years ago
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
facebookresearch / ActivityNet-Entities
View on GitHub
A Dataset for Grounded Video Description
☆165Jan 4, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
rasoolfa / videocap
View on GitHub
Memory-augmented Attention Modelling for Videos
☆10Apr 24, 2017Updated 9 years ago
Share14 / ShareGemini
View on GitHub
☆32Jul 29, 2024Updated last year
SunDoge / L-GCN
View on GitHub
PyTorch implementation of L-GCN [https://arxiv.org/abs/2008.09105]
☆25Apr 25, 2021Updated 5 years ago
sutdcv / SUTD-TrafficQA
View on GitHub
[CVPR 2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
☆66Feb 9, 2026Updated 5 months ago
doc-doc / NExT-OE
View on GitHub
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
☆30Jul 18, 2023Updated 3 years ago
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
coin-dataset / code
View on GitHub
☆48Apr 27, 2020Updated 6 years ago
henryhungle / MTN
View on GitHub
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
intersun / LightningDOT
View on GitHub
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
☆72Nov 14, 2022Updated 3 years ago
showlab / all-in-one
View on GitHub
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
☆281Mar 25, 2023Updated 3 years ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
agakshat / visualdialog-pytorch
View on GitHub
Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359
☆15May 16, 2019Updated 7 years ago
KaihuaTang / VQA2.0-Recent-Approachs-2018.pytorch
View on GitHub
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…
☆300Jan 6, 2026Updated 6 months ago