songrise / MoPELinks
Official Implementation for MoPE: Parameter-Efficient and Scalable Multimodal Fusion via Mixture of Prompt
☆22Updated last month
Alternatives and similar repositories for MoPE
Users that are interested in MoPE are comparing it to the libraries listed below
Sorting:
- ☆71Updated 4 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆68Updated last month
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆83Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆32Updated last year
- ☆82Updated 10 months ago
- [AAAI2024] Official implementation of TGP-T☆28Updated last year
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆24Updated 8 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆107Updated 3 months ago
- cliptrase☆45Updated last year
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆14Updated last month
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆44Updated 5 months ago
- The official implementation of RAR☆91Updated last year
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆74Updated 7 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆45Updated last year
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆79Updated 4 months ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆76Updated last year
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆148Updated last year
- [ICCV 2025] Official PyTorch Code for "Advancing Textual Prompt Learning with Anchored Attributes"☆89Updated last week
- CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts☆54Updated last year
- Easy wrapper for inserting LoRA layers in CLIP.☆36Updated last year
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models☆70Updated 3 months ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆53Updated last year
- ☆136Updated last year
- [ICML 2024] Official implementation for "Predictive Dynamic Fusion."☆63Updated 7 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆192Updated last month
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆58Updated last month
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆42Updated 10 months ago
- ☆19Updated 3 months ago
- [CVPR 2024] This is the official implementation of "MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Disti…☆18Updated 2 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆20Updated 3 months ago