ZLKong / awesome-token-reduction

A collection of recent token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI

☆27

Alternatives and similar repositories for awesome-token-reduction:

Users that are interested in awesome-token-reduction are comparing it to the libraries listed below

xuyang-liu16 / Awesome-Token-Reduction-for-Model-Compression
📚 Collection of token reduction for model compression resources.
☆47Updated last week
Gumpest / SparseVLMs
Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".
☆83Updated 3 weeks ago
lzhxmu / VTW
Code release for VTW (AAAI 2025) Oral
☆33Updated 2 months ago
JinXins / Awesome-Token-Merge-for-MLLMs
A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.
☆44Updated 2 months ago
ywh187 / FitPrune
☆41Updated 2 months ago
ZichenWen1 / DART
Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"
☆26Updated this week
Theia-4869 / FasterVLM
Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.
☆59Updated 3 months ago
42Shawn / LLaVA-PruMerge
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
☆121Updated 10 months ago
LINs-lab / DynMoE
[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
☆80Updated last month
Cooperx521 / PyramidDrop
(CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
☆81Updated 3 weeks ago
SUSTechBruce / LOOK-M
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆92Updated 4 months ago
Purshow / Awesome-LVLM-Hallucination
☆45Updated 4 months ago
Yaxin9Luo / Gamma-MOD
[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models
☆33Updated last month
Osilly / dynamic_llava
[ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…
☆26Updated 4 months ago
Purshow / Awesome-Unified-Multimodal
☆124Updated this week
wutaiqiang / MoSLoRA
☆99Updated 8 months ago
HKUST-LongGroup / Awesome-MLLM-Benchmarks
☆107Updated last month
NUS-HPC-AI-Lab / Dynamic-Tuning
The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"
☆43Updated 3 months ago
KD-TAO / DyCoke
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
☆38Updated last week
Open-DataFlow / Awesome_MLLMs_Reasoning
☆74Updated last week
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆56Updated last month
iboing / CorDA
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)
☆45Updated 2 months ago
MAC-AutoML / QuoTA
This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehens…
☆64Updated 2 weeks ago
NUS-HPC-AI-Lab / DD-Ranking
Data distillation benchmark
☆58Updated this week
ChangyuanWang17 / QVLM
[NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.
☆68Updated 2 months ago
saccharomycetes / mllms_know
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆97Updated last week
1zhou-Wang / MemVR
Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …
☆46Updated last month
yu-rp / apiprompting
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
☆79Updated 5 months ago
FeipengMa6 / VLoRA
[NeurIPS 2024] Visual Perception by Large Language Model’s Weights
☆41Updated 5 months ago
daixiangzi / Awesome-Token-Compress
A paper list of some recent works about Token Compress for Vit and VLM
☆391Updated this week