obananas/HoloV

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/obananas/HoloV)

obananas / HoloV

[NeurIPS 2025 🔥] Official implementation for "Don't Just Chase “Highlighted Tokens” in MLLMs: Revisiting Visual Holistic Context Retention"

☆66

Alternatives and similar repositories for HoloV

Users that are interested in HoloV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Chenfei-Liao / VTC-Bench
View on GitHub
[ACL2026 Main] Data & Code of "Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods"
☆35Apr 9, 2026Updated 3 months ago
Theia-4869 / CDPruner
View on GitHub
[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.
☆105Sep 20, 2025Updated 10 months ago
vbdi / divprune
View on GitHub
[CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models
☆86Apr 16, 2026Updated 3 months ago
EffiVLM-Bench / EffiVLM-Bench
View on GitHub
☆35Jun 3, 2025Updated last year
ZhengyaoFang / PruneSID
View on GitHub
Official code for **Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity** (PruneSI…
☆14Mar 25, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LunarShen / FastVID
View on GitHub
[NeurIPS 2025] FastVID: Dynamic Density Pruning for Fast Video Large Language Models
☆37Nov 10, 2025Updated 8 months ago
ZichenWen1 / DART
View on GitHub
[EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"
☆121Oct 12, 2025Updated 9 months ago
Ironieser / MMTok
View on GitHub
[ICLR 2026] The official repo of "MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs"
☆46Jul 3, 2026Updated 3 weeks ago
ZLKong / Awesome-Collection-Token-Reduction
View on GitHub
A collection of token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI
☆489Updated this week
hanxunyu / VisionTrim
View on GitHub
[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"
☆55Jun 17, 2026Updated last month
LaVi-Lab / AIM
View on GitHub
[ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"
☆65Oct 9, 2025Updated 9 months ago
HYUNJS / STTM
View on GitHub
[ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
☆61Feb 2, 2026Updated 5 months ago
yaolinli / TimeChat-Online
View on GitHub
[ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
☆132Jun 29, 2026Updated 3 weeks ago
cvlab-yonsei / RankMixup
View on GitHub
An official implementation of "RankMixup: Ranking-Based Mixup Training for Network Calibration" (ICCV 2023) in PyTorch.
☆11Dec 18, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Gumpest / SparseVLMs
View on GitHub
[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".
☆267Dec 22, 2025Updated 7 months ago
7zk1014 / PanoEnv
View on GitHub
☆15Jun 21, 2026Updated last month
FightingFighting / cross-modal-information-flow-in-MLLM
View on GitHub
This is the official repository for paper: cross-modal information flow in multimodal large language models
☆44May 21, 2025Updated last year
zifuwan / ONLY
View on GitHub
[ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
☆51Jul 7, 2025Updated last year
zhengxuJosh / AnySeg
View on GitHub
Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”
☆15Dec 6, 2024Updated last year
JIA-Lab-research / VisionZip
View on GitHub
Official repository for VisionZip (CVPR 2025)
☆443Jul 21, 2025Updated last year
Moenupa / VTCBench
View on GitHub
Code and data for VTCBench, a VLM benchmark for long-context understanding capabilities under vision-text compression paradigm.
☆27Mar 16, 2026Updated 4 months ago
Tencent / SelfEvolvingAgent
View on GitHub
Research works from Tencent AI Lab regarding self-evolving agents
☆97Jan 30, 2026Updated 5 months ago
cokeshao / Awesome-Multimodal-Token-Compression
View on GitHub
[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198
☆371May 29, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Danielement321 / HiPrune
View on GitHub
[ACL-2026 Findings] Implementation for HiPrune, a training-free visual token pruning method for VLM acceleration.
☆60Apr 29, 2026Updated 2 months ago
QC-LY / UiG
View on GitHub
Code for "Understanding-in-Generation:Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation"
☆15Nov 11, 2025Updated 8 months ago
cokeshao / HoliTom
View on GitHub
[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models
☆84Oct 10, 2025Updated 9 months ago
Theia-4869 / VisPruner
View on GitHub
[ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs
☆84Jul 1, 2025Updated last year
liaolea / TransPrune
View on GitHub
[CVPR 2026] TransPrune: Token Transition Pruning for Efficient Large Vision-Language Model
☆17Feb 23, 2026Updated 5 months ago
Cooperx521 / PyramidDrop
View on GitHub
(CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
☆151Mar 6, 2025Updated last year
OpenGVLab / VKnowU
View on GitHub
[ECCV 2026] VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs
☆16Feb 3, 2026Updated 5 months ago
KD-TAO / DyCoke
View on GitHub
[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
☆113Nov 22, 2025Updated 8 months ago
dingyue772 / OmniSIFT
View on GitHub
[ICML2026] OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
☆25May 21, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CR400AF-A / SparseMM
View on GitHub
[ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
☆88Jan 17, 2026Updated 6 months ago
EmbodiedCity / NeurIPS2025-Balanced-Token-Pruning
View on GitHub
[Neurips’25] Code for the paper "Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization"
☆31Sep 25, 2025Updated 10 months ago
HVision-NKU / GlimpsePrune
View on GitHub
[TCSVT] Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"
☆99Jun 12, 2026Updated last month
MAC-AutoML / SpecEyes
View on GitHub
[ECCV 2026🔥] This is the official implementation of our paper "SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception…
☆62Apr 2, 2026Updated 3 months ago
city1517 / FlexMem
View on GitHub
[CVPR2026 Highlight] FlexMem: Scaling the Long Video Understanding of MLLMs via Visual Memory Mechanism
☆29Apr 10, 2026Updated 3 months ago
AutoLab-SAI-SJTU / AutoPrune
View on GitHub
[NeurIPS 2025] AutoPrune, a general pruning method for LLM/VLM/VLA
☆20Oct 7, 2025Updated 9 months ago
zhengxuJosh / Awesome-Streaming-Video-Avatar
View on GitHub
Awesome-Streaming-Video-Avatar
☆21Jul 4, 2026Updated 3 weeks ago