songrise / MoPE
Official Implementation for MoPE: Parameter-Efficient and Scalable Multimodal Fusion via Mixture of Prompt
☆17Updated last month
Alternatives and similar repositories for MoPE:
Users that are interested in MoPE are comparing it to the libraries listed below
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆70Updated 8 months ago
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆32Updated last year
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆43Updated 2 weeks ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆68Updated 2 months ago
- [AAAI2024] Official implementation of TGP-T☆28Updated last year
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆136Updated 9 months ago
- [ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection☆20Updated this week
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆32Updated 3 weeks ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆70Updated 9 months ago
- ☆32Updated last year
- [NeurIPS 2024] Visual Perception by Large Language Model’s Weights☆43Updated 2 weeks ago
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆29Updated last month
- ☆46Updated last year
- CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts☆49Updated 7 months ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆45Updated 8 months ago
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆80Updated last year
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆32Updated last week
- The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"☆32Updated 4 months ago
- Multimodal-Composite-Editing-and-Retrieval-update☆32Updated 5 months ago
- ☆35Updated 2 years ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 6 months ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆33Updated 9 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆39Updated 9 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆97Updated 4 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆18Updated 2 months ago
- The official implementation of RAR☆85Updated last year
- cliptrase☆35Updated 7 months ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆17Updated last week
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆86Updated last year