Theia-4869 / CDPrunerView external linksLinks
[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.
☆86Sep 20, 2025Updated 4 months ago
Alternatives and similar repositories for CDPruner
Users that are interested in CDPruner are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆68Jul 1, 2025Updated 7 months ago
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆65Dec 1, 2025Updated 2 months ago
- Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"☆90Sep 10, 2025Updated 5 months ago
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆103Oct 12, 2025Updated 4 months ago
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆33Jan 11, 2026Updated last month
- This repo contains the code for studying the interplay between quantization and sparsity methods☆26Feb 26, 2025Updated 11 months ago
- [ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.☆27Apr 21, 2025Updated 9 months ago
- [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…☆553Jan 4, 2025Updated last year
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆113Dec 12, 2025Updated 2 months ago
- [TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198☆299Updated this week
- Let you in a meta world of The Palace Museum☆18Aug 30, 2025Updated 5 months ago
- ☆13Nov 30, 2024Updated last year
- [NeurIPS 2025] Efficient Reasoning Vision Language Models☆448Sep 18, 2025Updated 4 months ago
- The Official Implementation of Ada-KV [NeurIPS 2025]☆126Nov 26, 2025Updated 2 months ago
- ☆15Jan 3, 2024Updated 2 years ago
- This is a side project create by Lucy's and me, the main idea is use Bert model for fuse the stock news title and past stock price to pre…☆10Dec 19, 2022Updated 3 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- An official code of Densely-packed Object Detection via Hard Negative-Aware Anchor Attention in WACV2022☆12Jan 6, 2022Updated 4 years ago
- [ICML 2025] Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions☆14Sep 27, 2025Updated 4 months ago
- ☆16Apr 15, 2025Updated 9 months ago
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆73Jan 26, 2026Updated 2 weeks ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Web app for makeup transfer using Stable Diffusion☆10Sep 11, 2023Updated 2 years ago
- LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussians☆22Jan 10, 2025Updated last year
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆13Mar 11, 2025Updated 11 months ago
- ☆13Jan 7, 2025Updated last year
- Dataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.☆28Updated this week
- Hypergraph Vision Transformers: Images are More than Nodes, More than Edges☆17Jul 25, 2025Updated 6 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- A Windows Forms application for library computer login tracking. Records employee ID and computer number locally in CSV format and upload…☆28Nov 3, 2025Updated 3 months ago
- Reasoning in Space via Grounding in the World☆46Nov 3, 2025Updated 3 months ago
- Code for paper: [IEEE T-IV 2024] LXL: LiDAR Excluded Lean 3D Object Detection With 4D Imaging Radar and Camera Fusion☆23Jan 7, 2026Updated last month
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 3 months ago
- [NeurIPS 2025] HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models☆26Nov 30, 2025Updated 2 months ago
- [T-ITS 2024] EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving☆12Jun 8, 2025Updated 8 months ago
- spider: A Stata package for spider plots.☆13Jan 28, 2026Updated 2 weeks ago
- Evaluation code for Ref-L4, a new REC benchmark in the LMM era☆56Dec 28, 2024Updated last year
- An exploration of LLM steering☆24Jun 15, 2024Updated last year
- video stream scaler based on FPGA and verilog☆15Mar 28, 2024Updated last year