kawhiiiileo/FiCoCo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kawhiiiileo/FiCoCo)

kawhiiiileo / FiCoCo

[AAAI 26'] This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acceleration

☆47

Alternatives and similar repositories for FiCoCo

Users that are interested in FiCoCo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xuyang-liu16 / GlobalCom2
View on GitHub
[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models
☆42Jan 27, 2026Updated 6 months ago
xuyang-liu16 / V2Drop
View on GitHub
[CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models
☆34May 27, 2026Updated 2 months ago
hanxunyu / VisionTrim
View on GitHub
[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"
☆56Jun 17, 2026Updated last month
xuyang-liu16 / MixKV
View on GitHub
[ICLR 2026] Mixing Importance with Diversity: Joint Optimization for KV Cache Compression in Large Vision-Language Models
☆29Mar 21, 2026Updated 4 months ago
MILVLG / twigvlm
View on GitHub
Implementation of ICCV 2025 paper "Growing a Twig to Accelerate Large Vision-Language Models".
☆30May 23, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zifuwan / ONLY
View on GitHub
[ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
☆51Jul 7, 2025Updated last year
AIoT-MLSys-Lab / MEDA
View on GitHub
[NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
☆22Jun 19, 2025Updated last year
EffiVLM-Bench / EffiVLM-Bench
View on GitHub
☆35Jun 3, 2025Updated last year
lern-to-write / STC
View on GitHub
[CVPR 2026] Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
☆70Jun 8, 2026Updated last month
SuDIS-ZJU / Efficient-LVLMs-Inference
View on GitHub
[ACL 2026 Findings] Living repository for the survey paper “Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques…
☆26Apr 8, 2026Updated 3 months ago
zju-jiyicheng / LVSpec
View on GitHub
[ACL 2026 Main] See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video …
☆27Jul 4, 2026Updated 3 weeks ago
Lou1sM / meaningful_image_complexity
View on GitHub
☆17Mar 24, 2025Updated last year
Linking-ai / SCOPE
View on GitHub
(ACL2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation
☆36May 28, 2025Updated last year
cokeshao / HoliTom
View on GitHub
[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models
☆84Oct 10, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lzhxmu / VTW
View on GitHub
Code release for VTW (AAAI 2025 Oral)
☆68Nov 4, 2025Updated 8 months ago
Theia-4869 / FasterVLM
View on GitHub
Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.
☆114Jun 29, 2025Updated last year
JinXins / MergeMix
View on GitHub
[ICLR 2026] MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding
☆21Feb 27, 2026Updated 5 months ago
hulianyuyy / iLLaVA
View on GitHub
iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)
☆23Jun 24, 2026Updated last month
THU-MIG / PrefixKV
View on GitHub
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation [NeurIPS 2025]
☆19Oct 11, 2025Updated 9 months ago
THU-MIG / VTC-CLS
View on GitHub
official repo for paper "[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs"
☆22Apr 23, 2025Updated last year
xuyang-liu16 / Awesome-Token-level-Model-Compression
View on GitHub
📚 Collection of token-level model compression resources.
☆201Sep 3, 2025Updated 10 months ago
Ironieser / MMTok
View on GitHub
[ICLR 2026] The official repo of "MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs"
☆46Jul 3, 2026Updated 3 weeks ago
liuzuyan / ElasticCache
View on GitHub
[ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache
☆43Jul 26, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
nota-github / ERGO
View on GitHub
ERGO (Efficient Reasoning & Guided Observation) is a large vision-language model trained with reinforcement learning on efficiency object…
☆19Feb 25, 2026Updated 5 months ago
aiha-lab / InfiniPot-V
View on GitHub
[NeurIPS 25] InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding
☆20Jan 25, 2026Updated 6 months ago
kuijiang94 / DAWN
View on GitHub
DAWN: Direction-aware Attention Wavelet Network for Image Deraining
☆11Jan 7, 2024Updated 2 years ago
gogoczh / CoMT
View on GitHub
code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"
☆19Mar 10, 2025Updated last year
MilkThink-Lab / MiniLongBench
View on GitHub
[ACL 25] The Low-cost Long Context Understanding Benchmark for Large Language Models (Outstanding Paper Award)
☆23Jul 30, 2025Updated 11 months ago
xuyang-liu16 / VGDiffZero
View on GitHub
[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
☆17Feb 11, 2025Updated last year
YIGE24 / StreamingTOM
View on GitHub
☆27Mar 5, 2026Updated 4 months ago
FFY0 / AdaKV
View on GitHub
The Official Implementation of Ada-KV [NeurIPS 2025]
☆139Nov 26, 2025Updated 8 months ago
JulietChoo / VisionSelector
View on GitHub
VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
☆65Mar 24, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Cooperx521 / PyramidDrop
View on GitHub
(CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
☆151Mar 6, 2025Updated last year
vbdi / divprune
View on GitHub
[CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models
☆86Apr 16, 2026Updated 3 months ago
LivXue / VCIN
View on GitHub
Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…
☆13Apr 13, 2026Updated 3 months ago
codefanw / FlashSloth
View on GitHub
[CVPR2025] FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression
☆64Oct 10, 2025Updated 9 months ago
CR400AF-A / SparseMM
View on GitHub
[ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
☆88Jan 17, 2026Updated 6 months ago
viridisGreen / EarlyTom
View on GitHub
[CVPR 2026] EarlyTom: Early Token Compression Completes Fast Video Understanding
☆34Jun 22, 2026Updated last month
Visual-AI / PruneVid
View on GitHub
[ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models
☆72May 15, 2025Updated last year