stoneMo / CIGN
Official implementation for CIGN
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CIGN
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆11Updated 3 weeks ago
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆25Updated 3 weeks ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆11Updated 8 months ago
- ☆10Updated 4 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆34Updated 4 months ago
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆34Updated 3 months ago
- A python implement for Certifiable Robust Multi-modal Training☆13Updated 3 months ago
- ☆12Updated 11 months ago
- ☆16Updated last year
- NeurIPS'2023 official implementation code☆56Updated 11 months ago
- The official repository for ECCV2024 paper "PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery"☆13Updated 2 months ago
- ☆21Updated last year
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆25Updated 3 weeks ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆37Updated 10 months ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆55Updated 7 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆32Updated 4 months ago
- [NeurIPS 2024] Code for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models☆22Updated 3 weeks ago
- EPCFormer: Expression Prompt Collaboration Transformer for Universal Referring Video Object Segmentation☆9Updated last year
- [CVPR 2024] TEA: Test-time Energy Adaptation☆51Updated 8 months ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆33Updated 3 months ago
- ☆32Updated 11 months ago
- ☆26Updated last year
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆22Updated 5 months ago
- ☆13Updated 4 months ago
- Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024☆15Updated 7 months ago
- Multimodal Learning Method MLA for CVPR 2024☆56Updated 4 months ago
- [ICCV2023] Borrowing Knowledge From Pre-trained Language Model: A New Data-efficient Visual Learning Paradigm☆15Updated last year
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆39Updated last year
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆39Updated this week
- Accepted at ICCV '23☆13Updated last year