HCPLab-SYSU / CausalVLR
CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning (视觉-语言因果推理开源框架)
☆152Updated last month
Alternatives and similar repositories for CausalVLR:
Users that are interested in CausalVLR are comparing it to the libraries listed below
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆71Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆18Updated last year
- [BMVC2023] Spatial and Planar Consistency for Semi-Supervised Volumetric Medical Image Segmentation☆76Updated 7 months ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆20Updated last week
- "Towards Semi-supervised Learning with Non-random Missing Labels" by Yue Duan (ICCV 2023)☆77Updated 5 months ago
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Updated last year
- Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023☆152Updated 7 months ago
- Code release for Your “On-the-fly Category Discovery (CVPR 2023)”☆52Updated last year
- Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation(CVPR-2023)☆79Updated last year
- ☆53Updated 11 months ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆35Updated 4 months ago
- Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)☆97Updated 2 weeks ago
- Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-bas…☆100Updated last year
- [MICCAI 2023] DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification☆157Updated last year
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆78Updated last year
- ☆86Updated 2 years ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆71Updated 3 months ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆137Updated 10 months ago
- ☆87Updated 10 months ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆74Updated last month
- "MutexMatch: Semi-Supervised Learning with Mutex-Based Consistency Regularization" by Yue Duan (TNNLS)☆71Updated 5 months ago
- ☆13Updated last year
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆44Updated last month
- code for paper IJCAI2022☆12Updated 10 months ago
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆13Updated 7 months ago
- [ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"☆77Updated 2 years ago
- Panoptic Scene Graph Biased Annotation☆35Updated 10 months ago
- ☆132Updated 10 months ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆51Updated 6 months ago