Theia-4869 / CDPrunerView external linksLinks
[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.
☆86Sep 20, 2025Updated 4 months ago
Alternatives and similar repositories for CDPruner
Users that are interested in CDPruner are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆68Jul 1, 2025Updated 7 months ago
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆65Dec 1, 2025Updated 2 months ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆28Dec 9, 2025Updated 2 months ago
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆241Dec 22, 2025Updated last month
- ☆22Jun 5, 2025Updated 8 months ago
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆103Oct 12, 2025Updated 4 months ago
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆33Jan 11, 2026Updated last month
- This repo contains the code for studying the interplay between quantization and sparsity methods☆26Feb 26, 2025Updated 11 months ago
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆269Jul 6, 2025Updated 7 months ago
- [TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198☆299Updated this week
- [ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.☆27Apr 21, 2025Updated 9 months ago
- [ICCV 2025] Improving 3D Large Language Model via Robust Instruction Tuning☆68Oct 19, 2025Updated 3 months ago
- [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…☆553Jan 4, 2025Updated last year
- [ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs☆56Feb 2, 2026Updated last week
- ☆21May 18, 2025Updated 8 months ago
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆113Dec 12, 2025Updated 2 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆47Aug 1, 2025Updated 6 months ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Jun 19, 2025Updated 7 months ago
- This is the repository for the source code of the paper "Structure-Aware Single-Source Generalization with Pixel-Level Disentanglement fo…☆19Dec 22, 2024Updated last year
- ☆13Nov 30, 2024Updated last year
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆48Apr 21, 2025Updated 9 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆828Updated this week
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated 11 months ago
- [ICML 2025] Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions☆14Sep 27, 2025Updated 4 months ago
- ☆12Feb 2, 2024Updated 2 years ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- ☆13Jan 7, 2025Updated last year
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆73Jan 26, 2026Updated 2 weeks ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆13Mar 11, 2025Updated 11 months ago
- LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussians☆22Jan 10, 2025Updated last year
- 单帧点云(欧氏距离聚类分割) + Yolo_v2(GPU) / MobileNet_SSD(VPU NCS加速棒) 物体检测☆10Dec 13, 2018Updated 7 years ago
- Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”☆10Apr 3, 2022Updated 3 years ago
- [ICML'25] The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products☆15Jul 16, 2025Updated 6 months ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆22Nov 23, 2025Updated 2 months ago
- ☆11Mar 14, 2024Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- Code for paper: [IEEE T-IV 2024] LXL: LiDAR Excluded Lean 3D Object Detection With 4D Imaging Radar and Camera Fusion☆23Jan 7, 2026Updated last month
- video stream scaler based on FPGA and verilog☆15Mar 28, 2024Updated last year