[IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering
☆20Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for CMCIR
Users that are interested in CMCIR are comparing it to the libraries listed below
Sorting:
- Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition (Complexity 2018)☆13Dec 14, 2022Updated 3 years ago
- VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis☆13Dec 26, 2024Updated last year
- [IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition☆29Jan 6, 2025Updated last year
- ☆13Oct 23, 2023Updated 2 years ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆43Apr 27, 2025Updated 10 months ago
- The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering☆25Apr 18, 2025Updated 10 months ago
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆78Jul 6, 2023Updated 2 years ago
- [AAAI 2026] Causal-Tune: Mining Causal Factors from Vision Foundation Models for Domain Generalized Semantic Segmentation☆24Dec 28, 2025Updated 2 months ago
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Mar 17, 2022Updated 3 years ago
- The official implementation for SETA (TIP 2024).☆11Feb 17, 2025Updated last year
- [AAAI 2026] Official repository of the EMAformer paper: "EMAformer: Enhancing Transformer through Embedding Armor for Time Series Forecas…☆35Dec 3, 2025Updated 3 months ago
- This repository contains the python scripts developed as a part of the work presented in the paper "Low-latency auditory spatial attentio…☆10Sep 15, 2021Updated 4 years ago
- ☆40Nov 29, 2022Updated 3 years ago
- SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training☆84Aug 2, 2023Updated 2 years ago
- Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation☆16Dec 12, 2023Updated 2 years ago
- (AAAI2026) Open-World Deepfake Attribution via Confidence-Aware Asymmetric Learning (CAL)☆28Jan 1, 2026Updated 2 months ago
- 基于 React + router + redux + axios 和 Flask + MySQL + Pytorch 的视觉问答管理系统☆10Dec 12, 2022Updated 3 years ago
- LMM for VQA, tcsvt version☆11Jul 19, 2024Updated last year
- This repository contains the python scripts developed as a part of the work presented in the paper "STAnet: A Spatiotemporal Attention Ne…☆15May 10, 2023Updated 2 years ago
- ☆11Jul 11, 2023Updated 2 years ago
- An annotated transformer.☆13Jul 11, 2021Updated 4 years ago
- Code and Dataset for our CVPR 2022 paper "Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training"☆12Jul 8, 2022Updated 3 years ago
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Jan 15, 2025Updated last year
- Baseline for REVERIE-Challenge using HOP☆10Jul 4, 2022Updated 3 years ago
- Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)☆48Nov 3, 2022Updated 3 years ago
- CLUE code☆14May 1, 2025Updated 10 months ago
- Codes for coreference-aware machine reading comprehension☆13Mar 13, 2022Updated 3 years ago
- 2016 OpenMIIR Representation Learning Experiment☆10Feb 2, 2017Updated 9 years ago
- Code and data for Distributional Correlation–Aware Knowledge Distillation for Stock Trading Volume Prediction (ECML-PKDD 22)☆15Sep 6, 2022Updated 3 years ago
- [NeurIPS 2025] Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation☆25Oct 30, 2025Updated 4 months ago
- ☆15Aug 12, 2022Updated 3 years ago
- ☆14Sep 1, 2023Updated 2 years ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- Public code for VTSNN: A Virtual Temporal Spiking Neural Network (Fron. Neur.)☆53Aug 27, 2023Updated 2 years ago
- Preliminary code for reviewers☆13Mar 30, 2021Updated 4 years ago
- SEEG Project☆16Dec 14, 2020Updated 5 years ago
- Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]☆15May 13, 2023Updated 2 years ago