leo-yangli / VB-LoRA
This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).
☆24Updated last month
Related projects ⓘ
Alternatives and complementary repositories for VB-LoRA
- Adapting LLaMA Decoder to Vision Transformer☆27Updated 6 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆28Updated last month
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆17Updated last month
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆14Updated last month
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆39Updated 2 weeks ago
- Official code for ICLR 2024 paper Do Generated Data Always Help Contrastive Learning?☆28Updated 7 months ago
- [CVPR 2024 Highlight] ImageNet-D☆38Updated last month
- Implementation of "Breaking the Low-Rank Dilemma of Linear Attention"☆12Updated this week
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning☆36Updated 3 months ago
- The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".☆44Updated 3 weeks ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆30Updated 5 months ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆26Updated last month
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆11Updated last month
- Officail Repo of γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆18Updated 3 weeks ago
- HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Vision-Language M…☆13Updated 2 months ago
- ☆16Updated 2 years ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆83Updated 11 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆30Updated 7 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆70Updated 3 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆26Updated 5 months ago
- ☆30Updated last month
- GIFT: Generative Interpretable Fine-Tuning☆18Updated last month
- ☆27Updated last week
- ☆30Updated this week
- ☆16Updated last month
- ☆52Updated last year
- Making LLaVA Tiny via MoE-Knowledge Distillation☆60Updated 3 weeks ago
- ☆30Updated this week