YangLiu9208/CMCIR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YangLiu9208/CMCIR)

YangLiu9208 / CMCIR

[IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering

☆20

Alternatives and similar repositories for CMCIR

Users that are interested in CMCIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YangLiu9208 / JSRDA
View on GitHub
[IEEE T-CSVT 2019] Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition
☆14Nov 26, 2019Updated 6 years ago
YangLiu9208 / VisionGRU
View on GitHub
VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis
☆13Dec 26, 2024Updated last year
YangLiu9208 / SAKDN
View on GitHub
[IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition
☆29Jan 6, 2025Updated last year
HCPLab-SYSU / DDP-WM
View on GitHub
DDP-WM: Disentangled Dynamics Prediction for Efficient World Models (ICML-26)
☆19Mar 4, 2026Updated 4 months ago
HCPLab-SYSU / TAVP
View on GitHub
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation (CVPR-26)
☆25May 19, 2026Updated 2 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Sueqk / LMM-VQA
View on GitHub
LMM for VQA, tcsvt version
☆10Jul 19, 2024Updated 2 years ago
XLiu443 / Tem-adapter
View on GitHub
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
☆37Oct 18, 2023Updated 2 years ago
HCPLab-SYSU / EXPRESS-Bench
View on GitHub
Embodied Question Answering (EQA) benchmark and method (ICCV 2025)
☆60Aug 12, 2025Updated 11 months ago
LZ-CH / DSPNet
View on GitHub
The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
☆28Apr 18, 2025Updated last year
sutdcv / SUTD-TrafficQA
View on GitHub
[CVPR 2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
☆66Feb 9, 2026Updated 5 months ago
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
ecoxial2007 / FGRW_MedVQA
View on GitHub
Fine-Grained Knowledge Fusion for Retrieval-Augmented Medical Visual Question
☆11Jul 18, 2024Updated 2 years ago
zwq2018 / Auto_star
View on GitHub
auto star for repo lists
☆10Aug 26, 2023Updated 2 years ago
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AndersonStra / Mucko
View on GitHub
implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
☆10Mar 17, 2022Updated 4 years ago
MichiganNLP / In-the-wild-QA
View on GitHub
In-the-wild Question Answering
☆15May 10, 2023Updated 3 years ago
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
JerryWisdom / react-vqa-master
View on GitHub
基于 React + router + redux + axios 和 Flask + MySQL + Pytorch 的视觉问答管理系统
☆10Dec 12, 2022Updated 3 years ago
kyegomez / BRAVE-ViT-Swarm
View on GitHub
Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"
☆26Jun 22, 2026Updated last month
neuhai / FairytaleQA_Dataset
View on GitHub
The backup repository for FairytaleQA dataset and paper "Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset f…
☆10May 30, 2023Updated 3 years ago
YZHJessica / CDVQA
View on GitHub
☆14Feb 17, 2023Updated 3 years ago
LX-doctorAI1 / DeltaNet
View on GitHub
☆18Nov 11, 2022Updated 3 years ago
yihong-97 / STICT
View on GitHub
Code and Dataset for our CVPR 2022 paper "Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training"
☆12Jul 8, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
princetonvisualai / pointingqa
View on GitHub
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆19Oct 4, 2022Updated 3 years ago
PhoebusSi / Thinking-while-Observing
View on GitHub
Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"
☆12Jun 30, 2023Updated 3 years ago
liuxy1103 / CRAC
View on GitHub
☆13Jan 12, 2024Updated 2 years ago
longbai1006 / Surgical-VQLAPlus
View on GitHub
Official Implementation of "Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question Localized-Answering i…
☆15May 6, 2025Updated last year
sangminwoo / Temporal-Span-Proposal-Network-VidVRD
View on GitHub
[ESWA 2025] Official pytorch implementation of "What and When to look?: Temporal Span Proposal Network for Video Relation Detection"
☆16Aug 9, 2021Updated 4 years ago
LivXue / VCIN
View on GitHub
Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…
☆13Apr 13, 2026Updated 3 months ago
YulongBonjour / BrainCLIP
View on GitHub
Coming soon~
☆14Jul 15, 2025Updated last year
lingeringlight / SETA
View on GitHub
The official implementation for SETA (TIP 2024).
☆12Feb 17, 2025Updated last year
NLP-Discourse-SoochowU / GAN_DP
View on GitHub
Longyin Zhang, Fang Kong, and Guodong Zhou. Adversarial Learning for Discourse Rhetorical Structure Parsing. Accepted by ACL-IJCNLP2021.
☆19Jan 12, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
AgarwalVedika / CausalVQA
View on GitHub
☆12Jun 17, 2020Updated 6 years ago
bollossom / VTSNN
View on GitHub
Public code for VTSNN: A Virtual Temporal Spiking Neural Network (Fron. Neur.)
☆53Aug 27, 2023Updated 2 years ago
luciusssss / why-learn-shortcut
View on GitHub
[ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?
☆16Aug 8, 2023Updated 2 years ago
guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
2-mo / Awesome-Thinking-with-VAD
View on GitHub
☆17May 26, 2026Updated 2 months ago
liuxy1103 / BISSG
View on GitHub
code for paper IJCAI2022
☆13Jul 2, 2024Updated 2 years ago
wapping / annotated-transformer
View on GitHub
An annotated transformer.
☆13Jul 11, 2021Updated 5 years ago