yl3800/IGV

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yl3800/IGV)

yl3800 / IGV

This repo contains code for Invariant Grounding for Video Question Answering

☆27

Alternatives and similar repositories for IGV

Users that are interested in IGV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yl3800 / EIGV
View on GitHub
☆15Aug 12, 2022Updated 3 years ago
doc-doc / NExT-QA
View on GitHub
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
☆189Aug 2, 2025Updated 11 months ago
doc-doc / NExT-OE
View on GitHub
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
☆30Jul 18, 2023Updated 3 years ago
sail-sg / VGT
View on GitHub
Video Graph Transformer for Video Question Answering (ECCV'22)
☆49Jun 8, 2023Updated 3 years ago
doc-doc / NExT-GQA
View on GitHub
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
☆89Jul 1, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
thaolmk54 / hcrn-videoqa
View on GitHub
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆135Jul 25, 2024Updated 2 years ago
SunDoge / L-GCN
View on GitHub
PyTorch implementation of L-GCN [https://arxiv.org/abs/2008.09105]
☆25Apr 25, 2021Updated 5 years ago
madeleinegrunde / AGQA_baselines_code
View on GitHub
☆18Nov 1, 2023Updated 2 years ago
ByZ0e / Glance-Focus
View on GitHub
This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)
☆31Jun 28, 2024Updated 2 years ago
csbobby / STAR_Benchmark
View on GitHub
☆36Apr 18, 2024Updated 2 years ago
NJUPT-MCC / DualVGR-VideoQA
View on GitHub
Implementation for the journal paper "DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering" (Jianyu et al., IEEE Tran…
☆18Jun 22, 2021Updated 5 years ago
nguyentthong / video-language-understanding
View on GitHub
[ACL’24 Findings] Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
☆49May 12, 2026Updated 2 months ago
Trunpm / TPT-for-VideoQA
View on GitHub
☆19Nov 25, 2022Updated 3 years ago
antoyang / just-ask
View on GitHub
[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
☆127Sep 29, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
showlab / mist
View on GitHub
☆37Dec 20, 2023Updated 2 years ago
luohongyin / PILM
View on GitHub
Language model with phrase induction
☆14Jun 13, 2019Updated 7 years ago
rxtan2 / video-grounding-narrations
View on GitHub
☆12Mar 12, 2023Updated 3 years ago
bcmi / Causal-VidQA
View on GitHub
[CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…
☆78Jun 23, 2025Updated last year
Yui010206 / SeViLA
View on GitHub
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
☆198Jan 14, 2024Updated 2 years ago
acharkq / Training-Free-Graph-Matching
View on GitHub
Source code of "Training Free Graph Neural Networks for Graph Matching"
☆12Jul 9, 2022Updated 4 years ago
StanLei52 / TQVSR
View on GitHub
[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
☆24Sep 11, 2023Updated 2 years ago
zhousheng97 / ViTXT-GQA
View on GitHub
[IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering
☆17Feb 16, 2026Updated 5 months ago
hughplay / TVR
View on GitHub
Transformation Driven Visual Reasoning - CVPR 2021
☆36May 27, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Annusha / LIReC
View on GitHub
Learning Interactions and Relationships between Movie Characters (CVPR'20)
☆22Apr 12, 2023Updated 3 years ago
ZijiaLewisLu / CVPR2025-DeCafNet
View on GitHub
Official Repo for CVPR 2025 Paper -- DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos
☆17Mar 16, 2026Updated 4 months ago
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
MichiganNLP / In-the-wild-QA
View on GitHub
In-the-wild Question Answering
☆15May 10, 2023Updated 3 years ago
doc-doc / vRGV
View on GitHub
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
☆57Dec 8, 2022Updated 3 years ago
waxnkw / IETrans-SGG.pytorch
View on GitHub
This is the code of ECCV 2022 (Oral) paper "Fine-Grained Scene Graph Generation with Data Transfer".
☆103Jan 24, 2023Updated 3 years ago
YYJMJC / Compositional-Temporal-Grounding
View on GitHub
☆31Mar 24, 2022Updated 4 years ago
jochemloedeman / PGN
View on GitHub
Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…
☆44Sep 11, 2024Updated last year
WillSuen / GANs
View on GitHub
MXNet implementation of infoGAN, WGAN, CycleGAN
☆10Jan 28, 2018Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
zhousheng97 / EgoTextVQA
View on GitHub
[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
☆52Jun 19, 2025Updated last year
Zhiquan-Wen / D-VQA
View on GitHub
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
☆26Oct 13, 2022Updated 3 years ago
chenbinghui1 / ECAML
View on GitHub
AAAI2019
☆13Jan 22, 2019Updated 7 years ago
Oneplus / ELMo
View on GitHub
☆10May 20, 2019Updated 7 years ago
showlab / GEB-Plus
View on GitHub
[ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval
☆17Aug 24, 2022Updated 3 years ago
sam575 / axial-gan
View on GitHub
Code for "Simultaneous Face Hallucination and Translation for Thermal to Visible Face Verification using Axial-GAN"
☆15Apr 15, 2021Updated 5 years ago
YirongMao / COSONet
View on GitHub
The source code for the paper: Yirong Mao, Ruiping Wang, Shiguang Shan, Xilin Chen. COSONet: Compact Second-Order Network for Video Face …
☆12Dec 27, 2018Updated 7 years ago