VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis
☆13Dec 26, 2024Updated last year
Alternatives and similar repositories for VisionGRU
Users that are interested in VisionGRU are comparing it to the libraries listed below
Sorting:
- ☆13Oct 23, 2023Updated 2 years ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆45Apr 27, 2025Updated 10 months ago
- Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition (Complexity 2018)☆13Dec 14, 2022Updated 3 years ago
- [IEEE T-CSVT 2019] Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition☆14Nov 26, 2019Updated 6 years ago
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆20Jul 6, 2023Updated 2 years ago
- The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering☆25Apr 18, 2025Updated 11 months ago
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- [IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition☆29Jan 6, 2025Updated last year
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆48Aug 12, 2025Updated 7 months ago
- 【CVPR 2026 Finding】Official Repo for Paper ‘’Heartcare Suite: A Unified Multimodal ECG Suite for Dual Signal-Image Modeling and Understan…☆29Feb 24, 2026Updated 3 weeks ago