JieShibo / MemVPView external linksLinks
[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
☆50May 12, 2024Updated last year
Alternatives and similar repositories for MemVP
Users that are interested in MemVP are comparing it to the libraries listed below
Sorting:
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆90Nov 28, 2023Updated 2 years ago
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆53Sep 21, 2023Updated 2 years ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi ‑Faceted Efficient Adaptation of Large Models☆22Jul 10, 2025Updated 7 months ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆62Nov 5, 2024Updated last year
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆198Aug 1, 2023Updated 2 years ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆86Mar 21, 2024Updated last year
- ☆46Nov 8, 2024Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated this week
- (IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"☆13Mar 20, 2025Updated 10 months ago
- 【NeurIPS 2024】Dense Connector for MLLMs☆180Oct 14, 2024Updated last year
- ☆11Jan 19, 2025Updated last year
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Jan 27, 2024Updated 2 years ago
- Collection of papers about video-audio understanding☆22Dec 26, 2025Updated last month
- The efficient tuning method for VLMs☆80Mar 10, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆37Aug 18, 2024Updated last year
- ☆15Nov 7, 2024Updated last year
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation☆64Jul 10, 2025Updated 7 months ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆31Mar 11, 2025Updated 11 months ago
- Official Implementation of Convolutional Normalization: Improving Robustness and Training for Deep Neural Networks☆30Apr 13, 2022Updated 3 years ago
- A Python library for processing and filtering TabLib☆13Aug 24, 2024Updated last year
- ☆16Mar 24, 2025Updated 10 months ago
- ☆18Mar 19, 2025Updated 10 months ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆33Jan 3, 2024Updated 2 years ago
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆29Dec 27, 2023Updated 2 years ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago
- Contrastive Model Adaptation for Cross-Condition Robustness in Semantic Segmentation [ICCV 2023]☆40Aug 30, 2023Updated 2 years ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- ☆17May 31, 2023Updated 2 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts☆336Jul 17, 2024Updated last year
- ☆20Apr 16, 2025Updated 9 months ago
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆22Apr 22, 2025Updated 9 months ago
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆18Mar 13, 2025Updated 11 months ago
- LMM solved catastrophic forgetting, AAAI2025☆45Apr 15, 2025Updated 9 months ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆163Sep 27, 2025Updated 4 months ago
- ☆36Aug 27, 2025Updated 5 months ago
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆25Jun 4, 2025Updated 8 months ago