thu-nics / FrameFusionLinks

[ICCV'25] The official code implementation of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"

☆51

Alternatives and similar repositories for FrameFusion

Users that are interested in FrameFusion are comparing it to the libraries listed below

Sorting:

KD-TAO / DyCoke
[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
☆64Updated last month
Gumpest / SparseVLMs
[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".
☆138Updated 2 months ago
Theia-4869 / FasterVLM
Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.
☆84Updated last month
JinXins / Awesome-Token-Merge-for-MLLMs
A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.
☆69Updated 6 months ago
ChangyuanWang17 / QVLM
[NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.
☆79Updated 7 months ago
ZichenWen1 / DART
Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"
☆64Updated 3 months ago
cokeshao / HoliTom
HoliTom: Holistic Token Merging for Fast Video Large Language Models
☆38Updated 2 months ago
Cooperx521 / PyramidDrop
(CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
☆117Updated 5 months ago
42Shawn / LLaVA-PruMerge
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
☆142Updated last month
ywh187 / FitPrune
☆54Updated 3 months ago
cokeshao / Awesome-Multimodal-Token-Compression
Survey: https://arxiv.org/pdf/2507.20198
☆69Updated this week
xuyang-liu16 / Awesome-Token-level-Model-Compression
📚 Collection of token-level model compression resources.
☆147Updated last month
lzhxmu / VTW
Code release for VTW (AAAI 2025 Oral)
☆47Updated 3 weeks ago
Theia-4869 / CDPruner
Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.
☆45Updated last month
MAC-AutoML / QuoTA
This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehens…
☆73Updated 3 months ago
xuyang-liu16 / VidCom2
🚀 Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models
☆28Updated last month
ncTimTang / AKS
[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding
☆87Updated 3 months ago
double125 / MADTP
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer
☆45Updated 11 months ago
ThisisBillhe / ZipCache
[NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
☆23Updated 4 months ago
hasanar1f / HiRED
[AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…
☆41Updated 3 months ago
Osilly / dynamic_llava
[ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…
☆49Updated 8 months ago
Visual-AI / PruneVid
The official repository for ACL2025 paper "PruneVid: Visual Token Pruning for Efficient Video Large Language Models".
☆51Updated 2 months ago
liuting20 / MustDrop
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
☆31Updated 7 months ago
NUS-HPC-AI-Lab / Dynamic-Tuning
The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"
☆46Updated 7 months ago
ZhangAIPI / YOPO_MLLM_Pruning
Pruning the VLLMs
☆99Updated 8 months ago
thu-nics / MBQ
The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"
☆50Updated 4 months ago
KangarooGroup / Kangaroo
official impelmentation of Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
☆68Updated 11 months ago
MCG-NJU / p-MoD
[ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
☆41Updated last month
ModelTC / TFMQ-DM
[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…
☆103Updated last month
yu-rp / VisualPerceptionToken
☆93Updated 4 months ago