JulietChoo / VisionSelectorLinks
VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
☆48Updated 2 months ago
Alternatives and similar repositories for VisionSelector
Users that are interested in VisionSelector are comparing it to the libraries listed below
Sorting:
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆36Updated last year
- Code release for VTW (AAAI 2025 Oral)☆65Updated 2 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆69Updated 6 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆234Updated last year
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆62Updated last week
- ☆125Updated last year
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆153Updated 6 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆67Updated 3 months ago
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆63Updated last month
- ☆152Updated last year
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆52Updated last year
- 📚 Collection of token-level model compression resources.☆189Updated 4 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆80Updated 2 months ago
- Multimodal Large Language Model (MLLM) Tuning Survey: Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model☆91Updated 5 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆164Updated 6 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆180Updated 3 months ago
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆101Updated 6 months ago
- [ICLR 2025 Oral🔥] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning☆75Updated 6 months ago
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆104Updated last year
- [ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation☆218Updated 9 months ago
- Awesome Low-Rank Adaptation☆59Updated 5 months ago
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆217Updated 2 weeks ago
- ☆62Updated 8 months ago
- Agentic MLLMs☆133Updated 2 months ago
- One-shot Entropy Minimization☆187Updated 6 months ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆78Updated 3 months ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆53Updated 11 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆290Updated last week
- [ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models☆35Updated last year
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆97Updated last month