Theia-4869/CDPruner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Theia-4869/CDPruner)

Theia-4869 / CDPruner

[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

☆105

Alternatives and similar repositories for CDPruner

Users that are interested in CDPruner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vbdi / divprune
View on GitHub
[CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models
☆86Apr 16, 2026Updated 3 months ago
Theia-4869 / VisPruner
View on GitHub
[ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs
☆84Jul 1, 2025Updated last year
ZichenWen1 / DART
View on GitHub
[EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"
☆121Oct 12, 2025Updated 9 months ago
Gumpest / SparseVLMs
View on GitHub
[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".
☆267Dec 22, 2025Updated 7 months ago
Danielement321 / HiPrune
View on GitHub
[ACL-2026 Findings] Implementation for HiPrune, a training-free visual token pruning method for VLM acceleration.
☆60Apr 29, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
obananas / HoloV
View on GitHub
[NeurIPS 2025 🔥] Official implementation for "Don't Just Chase “Highlighted Tokens” in MLLMs: Revisiting Visual Holistic Context Retenti…
☆66Mar 5, 2026Updated 4 months ago
JIA-Lab-research / VisionZip
View on GitHub
Official repository for VisionZip (CVPR 2025)
☆443Jul 21, 2025Updated last year
ZhengyaoFang / PruneSID
View on GitHub
Official code for **Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity** (PruneSI…
☆14Mar 25, 2026Updated 4 months ago
KD-TAO / DyCoke
View on GitHub
[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
☆113Nov 22, 2025Updated 8 months ago
HVision-NKU / GlimpsePrune
View on GitHub
[TCSVT] Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"
☆99Jun 12, 2026Updated last month
LunarShen / FastVID
View on GitHub
[NeurIPS 2025] FastVID: Dynamic Density Pruning for Fast Video Large Language Models
☆37Nov 10, 2025Updated 8 months ago
cokeshao / HoliTom
View on GitHub
[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models
☆84Oct 10, 2025Updated 9 months ago
LaVi-Lab / AIM
View on GitHub
[ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"
☆65Oct 9, 2025Updated 9 months ago
daixiangzi / Awesome-Token-Compress
View on GitHub
A paper list of some recent works about Token Compress for Vit and VLM
☆944Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
cokeshao / Awesome-Multimodal-Token-Compression
View on GitHub
[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198
☆371May 29, 2026Updated last month
Theia-4869 / FasterVLM
View on GitHub
Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.
☆114Jun 29, 2025Updated last year
anakin-skywalker-Joseph / Folder
View on GitHub
Official Implementation of Paper FOLDER (ICCV2025) and Turbo (ECCV2024)
☆15Jun 27, 2025Updated last year
seilk / VisAttnSink
View on GitHub
[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models
☆116Feb 16, 2025Updated last year
Time-Search / TimeSearch-R
View on GitHub
[ICLR 2026] Official code for paper: TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinf…
☆27Jan 29, 2026Updated 5 months ago
EffiVLM-Bench / EffiVLM-Bench
View on GitHub
☆35Jun 3, 2025Updated last year
xuyang-liu16 / VidCom2
View on GitHub
[EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models
☆127May 14, 2026Updated 2 months ago
liaolea / TransPrune
View on GitHub
[CVPR 2026] TransPrune: Token Transition Pruning for Efficient Large Vision-Language Model
☆17Feb 23, 2026Updated 5 months ago
Cooperx521 / PyramidDrop
View on GitHub
(CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
☆151Mar 6, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lzhxmu / VTW
View on GitHub
Code release for VTW (AAAI 2025 Oral)
☆68Nov 4, 2025Updated 8 months ago
ZLKong / Awesome-Collection-Token-Reduction
View on GitHub
A collection of token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI
☆489Updated this week
xuyang-liu16 / Awesome-Token-level-Model-Compression
View on GitHub
📚 Collection of token-level model compression resources.
☆201Sep 3, 2025Updated 10 months ago
CR400AF-A / SparseMM
View on GitHub
[ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
☆88Jan 17, 2026Updated 6 months ago
Chenfei-Liao / VTC-Bench
View on GitHub
[ACL2026 Main] Data & Code of "Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods"
☆35Apr 9, 2026Updated 3 months ago
Ironieser / MMTok
View on GitHub
[ICLR 2026] The official repo of "MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs"
☆46Jul 3, 2026Updated 3 weeks ago
pkunlp-icler / FastV
View on GitHub
[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…
☆591Jan 4, 2025Updated last year
xinyouu / V-CAST
View on GitHub
V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models
☆34Apr 16, 2026Updated 3 months ago
nota-github / ERGO
View on GitHub
ERGO (Efficient Reasoning & Guided Observation) is a large vision-language model trained with reinforcement learning on efficiency object…
☆19Feb 25, 2026Updated 5 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Theia-4869 / MoSA
View on GitHub
Official code of MoSA (Mixture of Sparse Adapters).
☆13Dec 14, 2023Updated 2 years ago
wangqinsi1 / 2025-ICML-CoreMatching
View on GitHub
[ICML 2025] CoreMatching: Co-adaptive Sparse Inference Framework for Comprehensive Acceleration of Vision Language Model
☆16May 27, 2025Updated last year
orailix / PACT
View on GitHub
[CVPR 2025] PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models
☆60Jan 30, 2026Updated 5 months ago
hasanar1f / HiRED
View on GitHub
[AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…
☆58Apr 18, 2025Updated last year
HYUNJS / STTM
View on GitHub
[ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
☆61Feb 2, 2026Updated 5 months ago
lntzm / HICom
View on GitHub
[CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
☆21Apr 30, 2025Updated last year
sangminwoo / awesome-token-redundancy-reduction
View on GitHub
😎 Awesome papers on token redundancy reduction
☆14Mar 12, 2025Updated last year