MCR-PEFT / C-MCR
☆38Updated last year
Alternatives and similar repositories for C-MCR:
Users that are interested in C-MCR are comparing it to the libraries listed below
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆80Updated 11 months ago
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆42Updated last year
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆47Updated 3 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆38Updated 7 months ago
- ☆29Updated 7 months ago
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning☆45Updated 9 months ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆43Updated 6 months ago
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆55Updated 10 months ago
- A python implement for Certifiable Robust Multi-modal Training☆17Updated 6 months ago
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆76Updated 10 months ago
- Visual self-questioning for large vision-language assistant.☆40Updated 4 months ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆15Updated last year
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆85Updated last year
- The efficient tuning method for VLMs☆77Updated 11 months ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆26Updated 9 months ago
- ☆41Updated last year
- [CVPR' 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆45Updated 6 months ago
- ☆24Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆131Updated 8 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆19Updated 5 months ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆16Updated last week
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated 2 years ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆51Updated 3 weeks ago
- ☆62Updated last year
- ☆89Updated last year
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆67Updated 2 weeks ago
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆65Updated last month
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models☆45Updated 2 months ago
- [ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".☆24Updated last year