[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
☆50May 12, 2024Updated last year
Alternatives and similar repositories for MemVP
Users that are interested in MemVP are comparing it to the libraries listed below
Sorting:
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆90Nov 28, 2023Updated 2 years ago
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆53Sep 21, 2023Updated 2 years ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Jul 10, 2025Updated 7 months ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆21Jan 29, 2025Updated last year
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆64Nov 5, 2024Updated last year
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆198Aug 1, 2023Updated 2 years ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆86Mar 21, 2024Updated last year
- ☆61May 2, 2025Updated 10 months ago
- ☆47Nov 8, 2024Updated last year
- (IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"☆13Mar 20, 2025Updated 11 months ago
- ☆11Jan 19, 2025Updated last year
- 【NeurIPS 2024】Dense Connector for MLLMs☆181Oct 14, 2024Updated last year
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Jan 27, 2024Updated 2 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- [ECCV 2022] Contrastive Prototypical Network with Wasserstein Confidence Penalty☆11Oct 20, 2022Updated 3 years ago
- code used on the paper Face Reconstruction with Variational Autoencoder and Face Masks https://arxiv.org/abs/2112.02139☆12Jul 23, 2024Updated last year
- Collection of papers about video-audio understanding☆22Dec 26, 2025Updated 2 months ago
- The efficient tuning method for VLMs☆80Mar 10, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆37Aug 18, 2024Updated last year
- ☆15Nov 7, 2024Updated last year
- Log-Polar Space Convolution for Convolutional Neural Networks☆13Dec 12, 2022Updated 3 years ago
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation☆64Feb 20, 2026Updated 2 weeks ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆31Mar 11, 2025Updated 11 months ago
- Official Implementation of Convolutional Normalization: Improving Robustness and Training for Deep Neural Networks☆30Apr 13, 2022Updated 3 years ago
- A Python library for processing and filtering TabLib☆13Aug 24, 2024Updated last year
- ☆16Mar 24, 2025Updated 11 months ago
- ☆18Mar 19, 2025Updated 11 months ago
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆29Dec 27, 2023Updated 2 years ago
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆33Jan 3, 2024Updated 2 years ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago
- Contrastive Model Adaptation for Cross-Condition Robustness in Semantic Segmentation [ICCV 2023]☆40Aug 30, 2023Updated 2 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- ☆17May 31, 2023Updated 2 years ago
- [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts☆336Jul 17, 2024Updated last year
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆22Apr 22, 2025Updated 10 months ago
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆17Mar 13, 2025Updated 11 months ago
- ☆20Apr 16, 2025Updated 10 months ago