TerryPei/CSP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TerryPei/CSP)

TerryPei / CSP

Cross-Self KV Cache Pruning for Efficient Vision-Language Inference

☆10

Alternatives and similar repositories for CSP

Users that are interested in CSP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JCruan519 / iDAT
View on GitHub
(ICME24) This is the offical repository of iDAT: inverse Distillation Adapter-Tuning.
☆13Apr 3, 2024Updated 2 years ago
SUSTechBruce / LOOK-M
View on GitHub
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆103Nov 9, 2024Updated last year
shirlyliu64 / ConvBench
View on GitHub
ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models
☆16Sep 27, 2024Updated last year
AIoT-MLSys-Lab / MEDA
View on GitHub
[NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
☆22Jun 19, 2025Updated last year
IVY-LVLM / CODE
View on GitHub
Official Implementation of CODE
☆17Sep 26, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
liuzuyan / ElasticCache
View on GitHub
[ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache
☆43Jul 26, 2024Updated last year
ShawnTan86 / TokenCarve
View on GitHub
This is the open-source code for TokenCarve.
☆25Jan 23, 2026Updated 5 months ago
YuHengsss / SD-RPN
View on GitHub
[ICLR2026] Catching the Details: Self-Distilled RoI Predictors for Fine-Grained MLLM Perception
☆17Jan 26, 2026Updated 5 months ago
TerryPei / NNRetrieval
View on GitHub
ICLR2024: Neural Architecture Retrieval
☆16Mar 13, 2024Updated 2 years ago
whongzhong / MMHalSnowball
View on GitHub
Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…
☆18Aug 12, 2024Updated last year
sled-group / moh
View on GitHub
[NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models
☆37Nov 13, 2024Updated last year
Cooperx521 / PyramidDrop
View on GitHub
(CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
☆151Mar 6, 2025Updated last year
IVY-LVLM / Counterfactual-Inception
View on GitHub
Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…
☆20Sep 26, 2024Updated last year
li-jl16 / LORS
View on GitHub
CVPR2024 highlight.
☆13Oct 10, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
markywg / transagent
View on GitHub
[NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
☆25Oct 17, 2024Updated last year
JCruan519 / GIST
View on GitHub
(ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.
☆11Jan 28, 2024Updated 2 years ago
alibaba / EfficientAI
View on GitHub
☆48May 9, 2026Updated 2 months ago
Gumpest / MasKD
View on GitHub
Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.
☆10Mar 13, 2023Updated 3 years ago
wangyu-ustc / LVChat
View on GitHub
The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**
☆14Apr 15, 2024Updated 2 years ago
Lackel / AGLA
View on GitHub
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆68Jul 16, 2024Updated 2 years ago
JoakimHaurum / ATC
View on GitHub
Official PyTorch implementation of Agglomerative Token Clustering presented at ECCV 2024
☆20Sep 19, 2024Updated last year
assasinXL / dsor_filter
View on GitHub
☆14Apr 11, 2022Updated 4 years ago
ArkadiyD / MacroNASBenchmark
View on GitHub
Benchmarks for Macro Neural Architecture Search; used and described in the paper "Local Search is a Remarkably Strong Baseline for Neural…
☆13Jul 25, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Kyyle2114 / Convolutional-Adapter-for-Segment-Anything
View on GitHub
CAD - Memory Efficient Convolutional Adapter for Segment Anything
☆12Oct 4, 2024Updated last year
pydaxing / DIRT-Deep-Learning-Enhanced-Item-Response-Theory-for-Cognitive
View on GitHub
DIRT:Deep Learning Enhanced Item Response Theory for Cognitive Diagnosis
☆25Dec 4, 2019Updated 6 years ago
bronyayang / HallE_Control
View on GitHub
HallE-Control: Controlling Object Hallucination in LMMs
☆32Apr 10, 2024Updated 2 years ago
zyxxmu / Bi-Mask
View on GitHub
Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"
☆13Jun 7, 2023Updated 3 years ago
Gumpest / AvatarKD
View on GitHub
[ACM MM'23] Official implementation of paper "Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty".
☆14Nov 22, 2023Updated 2 years ago
SeddonShen / NWPU_Latex_Template
View on GitHub
Latex Template for Northwestern Polytechnical University(NWPU) Report
☆17Oct 16, 2023Updated 2 years ago
MIRALab-USTC / LLM-AttentionPredictor
View on GitHub
The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…
☆29Jul 15, 2025Updated last year
liuting20 / MustDrop
View on GitHub
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
☆36Jan 8, 2025Updated last year
aimagelab / MaPeT
View on GitHub
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
☆16Jul 1, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aprilabdotdev / U08M11002
View on GitHub
U08M11002 信号与系统，西北工业大学
☆15Jan 19, 2024Updated 2 years ago
phdyang007 / ICCAD16-N7M2EUV
View on GitHub
EUV Layer Hotspot Detection Benchmark Suit
☆20Mar 8, 2021Updated 5 years ago
Papple-F / csg
View on GitHub
☆17Aug 8, 2024Updated last year
Vision-CAIR / Infinibench
View on GitHub
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
☆20Nov 4, 2025Updated 8 months ago
Linwei94 / ICML2023-DualFocalLoss
View on GitHub
Code for "Dual Focal Loss for Calibration" (ICML 2023)
☆31Apr 21, 2025Updated last year
yuranusduke / Tiled-Squeeze-and-Excitation
View on GitHub
Pytorch implementation of TSE attention
☆16Jul 9, 2021Updated 5 years ago
yangyifei729 / KVSharer
View on GitHub
Source code of paper ''KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing''
☆31Oct 24, 2024Updated last year