songrise / MoPELinks
Official Implementation for MoPE (T-MM 2025)
☆24Updated 2 months ago
Alternatives and similar repositories for MoPE
Users that are interested in MoPE are comparing it to the libraries listed below
Sorting:
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆33Updated last year
- ☆76Updated 7 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆73Updated 10 months ago
- This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AA…☆55Updated last week
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆85Updated last year
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆25Updated 11 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆76Updated 3 weeks ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆109Updated 3 weeks ago
- The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"☆36Updated last year
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆47Updated 9 months ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆153Updated last year
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆76Updated last year
- The official implementation of RAR☆93Updated this week
- ☆36Updated 2 years ago
- [AAAI2024] Official implementation of TGP-T☆31Updated last year
- Official Implementation of the ECCV 2024 Paper: "CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts"☆53Updated last month
- [CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge☆154Updated 3 months ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆33Updated last month
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆25Updated 4 months ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆92Updated 7 months ago
- [CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension☆60Updated last year
- ☆83Updated last year
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆88Updated 2 years ago
- ☆51Updated 2 years ago
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆74Updated 3 months ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆76Updated 2 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆50Updated last year
- Official PyTorch Code for Anchor Token Guided Prompt Learning Methods: [ICCV 2025] ATPrompt and [Arxiv 2511.21188] AnchorOPT☆115Updated last week
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆62Updated 5 months ago
- [AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training☆105Updated last year