songrise / MoPELinks
Official Implementation for MoPE (T-MM 2025)
☆25Updated 3 months ago
Alternatives and similar repositories for MoPE
Users that are interested in MoPE are comparing it to the libraries listed below
Sorting:
- ☆81Updated 8 months ago
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆33Updated 2 years ago
- ☆52Updated 2 years ago
- [AAAI2024] Official implementation of TGP-T☆32Updated last year
- ☆27Updated 2 years ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆73Updated 11 months ago
- ☆31Updated last year
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆109Updated last month
- Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".☆130Updated last year
- The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"☆36Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆85Updated last year
- The official repos of "Knowledge Bridger: Towards Training-Free Missing Modality Completion"☆19Updated 6 months ago
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆226Updated 2 years ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆77Updated last year
- ☆60Updated 6 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆50Updated last year
- Easy wrapper for inserting LoRA layers in CLIP.☆40Updated last year
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆92Updated last year
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆57Updated last year
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆281Updated 2 years ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆95Updated 8 months ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆78Updated 3 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆78Updated last month
- [ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆20Updated last year
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆152Updated last year
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆46Updated 2 years ago
- ☆83Updated last year
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆62Updated 6 months ago
- [ICML 2024] Official implementation for "Predictive Dynamic Fusion."☆69Updated 11 months ago
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆25Updated 5 months ago