[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.
☆103Sep 20, 2025Updated 8 months ago
Alternatives and similar repositories for CDPruner
Users that are interested in CDPruner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆81Jul 1, 2025Updated 11 months ago
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆79Apr 16, 2026Updated last month
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆32Dec 9, 2025Updated 6 months ago
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆266Dec 22, 2025Updated 5 months ago
- ☆26Jun 5, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198☆362May 29, 2026Updated 2 weeks ago
- Awesome Remote Sensing Vision-Language Datasets☆86Jun 8, 2026Updated last week
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30May 27, 2026Updated 2 weeks ago
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆120Oct 12, 2025Updated 8 months ago
- Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data☆20Aug 6, 2024Updated last year
- This repo contains the code for studying the interplay between quantization and sparsity methods☆26Feb 26, 2025Updated last year
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆281Jul 6, 2025Updated 11 months ago
- [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…☆581Jan 4, 2025Updated last year
- [ICCV 2025] Improving 3D Large Language Model via Robust Instruction Tuning☆71Oct 19, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆28Jul 21, 2025Updated 10 months ago
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆23Jul 25, 2024Updated last year
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆35Feb 22, 2026Updated 3 months ago
- [Neurips’25] Code for the paper "Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization"☆32Sep 25, 2025Updated 8 months ago
- ☆23May 18, 2025Updated last year
- ☆70Jun 1, 2025Updated last year
- This is the repository for the source code of the paper "Structure-Aware Single-Source Generalization with Pixel-Level Disentanglement fo…☆18Dec 22, 2024Updated last year
- ☆83Oct 13, 2025Updated 8 months ago
- [ICLR26] Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆195Jan 26, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2025] Official implementation for the paper"Towards Understanding How Knowledge Evolves in Large Vision-Language Models"☆32Apr 10, 2025Updated last year
- ☆10Dec 25, 2023Updated 2 years ago
- Code release for VTW (AAAI 2025 Oral)☆68Nov 4, 2025Updated 7 months ago
- [NeurIPS 2025] Efficient Reasoning Vision Language Models☆458Sep 18, 2025Updated 8 months ago
- [ICML 2025] Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions☆14Mar 7, 2026Updated 3 months ago
- Official code of MoSA (Mixture of Sparse Adapters).☆13Dec 14, 2023Updated 2 years ago
- Code for paper: [IEEE T-IV 2024] LXL: LiDAR Excluded Lean 3D Object Detection With 4D Imaging Radar and Camera Fusion☆26Jan 7, 2026Updated 5 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆81Feb 27, 2026Updated 3 months ago
- ☆14Mar 5, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆77Apr 9, 2026Updated 2 months ago
- ☆12Apr 16, 2024Updated 2 years ago
- ☆55Oct 3, 2024Updated last year
- The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"☆18Dec 5, 2024Updated last year
- ☆11May 19, 2025Updated last year
- 用于国科大自动评教。☆14Apr 23, 2024Updated 2 years ago
- ☆35Dec 14, 2025Updated 6 months ago