xuyang-liu16/V2Drop

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xuyang-liu16/V2Drop)

xuyang-liu16 / V2Drop

[CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models

☆34

Alternatives and similar repositories for V2Drop

Users that are interested in V2Drop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xuyang-liu16 / GlobalCom2
View on GitHub
[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models
☆42Jan 27, 2026Updated 5 months ago
xuyang-liu16 / VidCom2
View on GitHub
[EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models
☆127May 14, 2026Updated 2 months ago
xuyang-liu16 / MixKV
View on GitHub
[ICLR 2026] Mixing Importance with Diversity: Joint Optimization for KV Cache Compression in Large Vision-Language Models
☆29Mar 21, 2026Updated 4 months ago
Da1yuqin / TCDiff
View on GitHub
Official code for our AAAI25 oral👑 paper Harmonious Group Choreography with Trajectory-Controllable Diffusion — hope you enjoy exploring…
☆20Oct 3, 2025Updated 9 months ago
xuyang-liu16 / Awesome-Token-level-Model-Compression
View on GitHub
📚 Collection of token-level model compression resources.
☆201Sep 3, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GuoqingWang1 / WebFilter
View on GitHub
🌟Official code of our AAAI26 paper 🔍WebFilter
☆40Nov 9, 2025Updated 8 months ago
Da1yuqin / SEAD
View on GitHub
[ACL26-Findings]💁📲 Self-evolving customer service framework, SEAD, operates without any human-labeled data. It can be quickly launched…
☆23May 13, 2026Updated 2 months ago
Da1yuqin / TCDiffpp
View on GitHub
🌟This is the official code for our IJCV25 paper TCDiff++ 💃💃💃
☆26Apr 30, 2026Updated 2 months ago
xuyang-liu16 / VGDiffZero
View on GitHub
[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
☆17Feb 11, 2025Updated last year
DabDans / AudioMarathon
View on GitHub
Code for "AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs"
☆26Oct 9, 2025Updated 9 months ago
anakin-skywalker-Joseph / Folder
View on GitHub
Official Implementation of Paper FOLDER (ICCV2025) and Turbo (ECCV2024)
☆15Jun 27, 2025Updated last year
liuting20 / DARA
View on GitHub
[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
☆22Feb 26, 2025Updated last year
EffiVLM-Bench / EffiVLM-Bench
View on GitHub
☆35Jun 3, 2025Updated last year
AutoLab-SAI-SJTU / AutoPrune
View on GitHub
[NeurIPS 2025] AutoPrune, a general pruning method for LLM/VLM/VLA
☆20Oct 7, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
AIoT-MLSys-Lab / MEDA
View on GitHub
[NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
☆22Jun 19, 2025Updated last year
Moenupa / VTCBench
View on GitHub
Code and data for VTCBench, a VLM benchmark for long-context understanding capabilities under vision-text compression paradigm.
☆27Mar 16, 2026Updated 4 months ago
ZichenWen1 / DART
View on GitHub
[EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"
☆121Oct 12, 2025Updated 9 months ago
kawhiiiileo / FiCoCo
View on GitHub
[AAAI 26'] This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acc…
☆47Nov 13, 2025Updated 8 months ago
lern-to-write / STC
View on GitHub
[CVPR 2026] Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
☆70Jun 8, 2026Updated last month
AshleyLuo001 / UTANet
View on GitHub
[AAAI 2025] Open-source, End-to-end, Medical Image Segmentation model by Task allociation.
☆39May 22, 2025Updated last year
LowLevelAI / GPP-LLIE
View on GitHub
Official implementation of GPP-LLIE, which is accpeted by AAAI 2025.
☆38Dec 13, 2025Updated 7 months ago
Visual-AI / PruneVid
View on GitHub
[ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models
☆72May 15, 2025Updated last year
ZichenWen1 / EPIC
View on GitHub
(NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"
☆49Feb 11, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
qxzha / UGNCL
View on GitHub
Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching (ACM SIGIR 2024, Pytorch Code)
☆22Apr 16, 2026Updated 3 months ago
Chenfei-Liao / VTC-Bench
View on GitHub
[ACL2026 Main] Data & Code of "Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods"
☆35Apr 9, 2026Updated 3 months ago
ucas-xiang / QIG
View on GitHub
[CVPR 2026] Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients
☆23Jun 21, 2026Updated last month
Disguiser15 / RefTeacher
View on GitHub
RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.
☆14May 26, 2023Updated 3 years ago
TungChintao / FlowCut
View on GitHub
[NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”
☆32Dec 9, 2025Updated 7 months ago
OPPO-Mente-Lab / PixelPrune
View on GitHub
PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding
☆28Jun 10, 2026Updated last month
Hokhim2 / CVBench
View on GitHub
☆19Aug 28, 2025Updated 10 months ago
FFY0 / AdaKV
View on GitHub
The Official Implementation of Ada-KV [NeurIPS 2025]
☆139Nov 26, 2025Updated 8 months ago
EnVision-Research / PAP
View on GitHub
Panoramic Affordance Prediction (PAP) (ECCV 2026)
☆46Jun 29, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
InnovatorLM / Innovator-VL
View on GitHub
Fully Open-source Multimodal Language Models for Science Discovery
☆168Mar 20, 2026Updated 4 months ago
huangmozhi9527 / GMMFormer
View on GitHub
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆21May 10, 2024Updated 2 years ago
JIA-Lab-research / VisionZip
View on GitHub
Official repository for VisionZip (CVPR 2025)
☆443Jul 21, 2025Updated last year
yaolinli / TimeChat-Online
View on GitHub
[ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
☆132Jun 29, 2026Updated 3 weeks ago
dingyue772 / OmniSIFT
View on GitHub
[ICML2026] OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
☆25May 21, 2026Updated 2 months ago
w-yibo / VTC-R1
View on GitHub
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning.
☆26Updated this week
ZichenWen1 / DIJA
View on GitHub
(ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"
☆79Feb 9, 2026Updated 5 months ago