YangLiu9208 / CMCIR
[IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CMCIR
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆90Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆72Updated last year
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆75Updated last year
- Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023☆147Updated 2 months ago
- Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-bas…☆103Updated 7 months ago
- ☆15Updated last year
- [ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"☆79Updated 2 years ago
- CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning (视觉-语言因果推理开源框架)☆134Updated 7 months ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆12Updated 2 months ago
- ☆19Updated 7 months ago
- An official implementation for MS-DETR in ACL'23☆16Updated last year
- Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)☆95Updated last year
- ☆84Updated last year
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆29Updated 7 months ago
- ☆33Updated last year
- The implementaion of CoDT on the task of NTU-60+->PKUMMD☆74Updated last year
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29Updated last year
- Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval☆22Updated 3 weeks ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆91Updated last year
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆45Updated 8 months ago
- A lightweight codebase for referring expression comprehension and segmentation☆52Updated 2 years ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆14Updated 3 weeks ago
- Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)☆17Updated 8 months ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆45Updated last year
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆29Updated 7 months ago
- ☆11Updated 11 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆59Updated 4 months ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆131Updated 3 weeks ago
- Pytorch Implementation of ECCV'22 paper: Video Activity Localisation with Uncertainties in Temporal Boundary☆15Updated 2 years ago
- ☆29Updated 7 months ago