JulietChoo / VisionSelectorLinks
VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
☆52Updated 3 months ago
Alternatives and similar repositories for VisionSelector
Users that are interested in VisionSelector are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆233Updated last year
- Code release for VTW (AAAI 2025 Oral)☆64Updated 3 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆69Updated 7 months ago
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆37Updated last year
- Awesome Low-Rank Adaptation☆59Updated 6 months ago
- [ICLR 2025 Oral🔥] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning☆76Updated 7 months ago
- ☆125Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆169Updated last week
- ☆152Updated last year
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆70Updated 4 months ago
- ☆28Updated last year
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆36Updated 2 weeks ago
- One-shot Entropy Minimization☆188Updated 7 months ago
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆80Updated last month
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆171Updated 4 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆84Updated 3 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆106Updated last month
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆203Updated last year
- toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts☆28Updated last year
- Awesome-Low-Rank-Adaptation☆128Updated last year
- ☆64Updated 2 weeks ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆78Updated 4 months ago
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆103Updated 3 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 7 months ago
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆65Updated 2 months ago
- Multimodal Large Language Model (MLLM) Tuning Survey: Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model☆94Updated 6 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆104Updated last month
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆52Updated last year
- ☆56Updated last year
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆45Updated 7 months ago