Danielement321 / HiPruneView external linksLinks
Implementation for HiPrune, a training-free visual token pruning method for VLM acceleration.
☆45Oct 29, 2025Updated 3 months ago
Alternatives and similar repositories for HiPrune
Users that are interested in HiPrune are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"☆54Oct 9, 2025Updated 4 months ago
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆44Apr 18, 2025Updated 9 months ago
- The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs☆118Jul 1, 2025Updated 7 months ago
- [ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs☆82Jan 17, 2026Updated 3 weeks ago
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆103Oct 12, 2025Updated 4 months ago
- (CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction☆141Mar 6, 2025Updated 11 months ago
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆32Mar 30, 2025Updated 10 months ago
- [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…☆553Jan 4, 2025Updated last year
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆33Oct 12, 2024Updated last year
- [ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs☆56Feb 2, 2026Updated last week
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆68Jul 1, 2025Updated 7 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆828Updated this week
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆106Jun 29, 2025Updated 7 months ago
- A collection of token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI☆315Updated this week
- ☆10May 12, 2018Updated 7 years ago
- Code for the paper "Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking"☆15Apr 12, 2025Updated 10 months ago
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆11Nov 13, 2024Updated last year
- ☆14Apr 19, 2025Updated 9 months ago
- odgt CrowdHuman dataset annotation to YOLO txt and Pascal VOC xml☆10Dec 1, 2020Updated 5 years ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 4 months ago
- See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction (NeurIPS 2025)☆25Oct 21, 2025Updated 3 months ago
- [ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.☆17Jun 19, 2025Updated 7 months ago
- Map Optimization and Ongoing Bug-fixing mod for Cities Skylines 2☆10Dec 1, 2023Updated 2 years ago
- Repository for the code of the simplex non-negative matrix factorization algorithm for EDXS data☆14Feb 6, 2025Updated last year
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated 11 months ago
- Source code for the paper: "Pantheon: Preemptible Multi-DNN Inference on Mobile Edge GPUs"☆15Apr 15, 2024Updated last year
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆204Jul 17, 2025Updated 6 months ago
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 3 months ago
- ☆13Apr 25, 2025Updated 9 months ago
- A handwritten Chemical Structure Image data set named EDU-CHEMC, which consists of totally 52,987 handwritten molecular structure images …☆14May 12, 2025Updated 9 months ago
- Chinese plate detection(基于YOLOv3的中文车牌检测)☆11Mar 8, 2021Updated 4 years ago
- ☆10Nov 22, 2022Updated 3 years ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆17Jun 19, 2025Updated 7 months ago
- 2021第三届华为云人工智能大赛 · 无人车挑战杯——车道线检测模块☆12Sep 26, 2021Updated 4 years ago
- Code release for VTW (AAAI 2025 Oral)☆64Nov 4, 2025Updated 3 months ago
- ☆64Jan 23, 2026Updated 3 weeks ago
- Yolo with jittor☆12Feb 17, 2022Updated 3 years ago
- Instruction Following Eval☆15Jan 16, 2025Updated last year
- [TCSVT2025] AVLTrack: Dynamic Sparse Learning for Aerial Vision-Language Tracking☆17Dec 1, 2025Updated 2 months ago