JulietChoo / VisionSelectorLinks
VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
☆53Updated 3 months ago
Alternatives and similar repositories for VisionSelector
Users that are interested in VisionSelector are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆233Updated last year
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆37Updated last year
- Code release for VTW (AAAI 2025 Oral)☆64Updated 2 months ago
- ☆125Updated last year
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆69Updated 6 months ago
- ☆152Updated last year
- ☆28Updated last year
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆52Updated last year
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆80Updated 3 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆68Updated 4 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆168Updated 7 months ago
- One-shot Entropy Minimization☆188Updated 7 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 6 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆185Updated 4 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆152Updated 6 months ago
- ☆64Updated last week
- [ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning☆77Updated 2 months ago
- ☆43Updated last year
- [ICLR 2025 Oral🔥] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning☆75Updated 7 months ago
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆65Updated 2 months ago
- toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts☆28Updated last year
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆104Updated 7 months ago
- ☆56Updated last year
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆78Updated 4 months ago
- MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer☆49Updated last year
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆102Updated 3 weeks ago
- ☆37Updated 5 months ago
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆73Updated last month
- Awesome Low-Rank Adaptation☆59Updated 5 months ago
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆34Updated last week