KD-TAO/VidKV

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KD-TAO/VidKV)

KD-TAO / VidKV

VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

☆25

Alternatives and similar repositories for VidKV

Users that are interested in VidKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KD-TAO / DyCoke
View on GitHub
[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
☆114Nov 22, 2025Updated 8 months ago
heliossun / STLLaVA-Med
View on GitHub
Self-training LLaVA for medical
☆16Nov 3, 2024Updated last year
CFinTech / SparseSSM
View on GitHub
[arxiv 2025] SparseSSM: Efficient Selective Structured State Space Models Can Be Pruned in One-Shot
☆22Oct 8, 2025Updated 9 months ago
yinyueqin / DenseRewardRLHF-PPO
View on GitHub
This repository contains the code and released models for the paper Segmenting Text and Learning Their Rewards for Improved RLHF in Langu…
☆19Jan 8, 2025Updated last year
salesforce / HIVE
View on GitHub
☆121Jun 2, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wanglichenxj / Dual-Relation-Semi-supervised-Multi-label-Learning
View on GitHub
☆23Sep 3, 2020Updated 5 years ago
SalesforceAIResearch / ThinK
View on GitHub
ThinK: Thinner Key Cache by Query-Driven Pruning
☆30Jun 2, 2026Updated last month
cokeshao / HoliTom
View on GitHub
[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models
☆84Oct 10, 2025Updated 9 months ago
heliossun / SQ-LLaVA
View on GitHub
Visual self-questioning for large vision-language assistant.
☆44Jul 23, 2025Updated last year
MingSun-Tse / Regularization-Pruning
View on GitHub
[ICLR'21] Neural Pruning via Growing Regularization (PyTorch)
☆82Jul 15, 2021Updated 5 years ago
MingSun-Tse / Why-the-State-of-Pruning-so-Confusing
View on GitHub
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…
☆41Sep 9, 2025Updated 10 months ago
MingSun-Tse / Good-DA-in-KD
View on GitHub
[NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective
☆37Dec 15, 2022Updated 3 years ago
alopezgit / DESC
View on GitHub
PyTorch implementation for DESC - BMVC20 (Oral) & IJCV22
☆17Dec 23, 2022Updated 3 years ago
Visual-AI / PruneVid
View on GitHub
[ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models
☆72May 15, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mlvlab / ST-VLM
View on GitHub
☆13Mar 28, 2025Updated last year
krafton-ai / lexico
View on GitHub
KV cache compression via sparse coding
☆17Oct 26, 2025Updated 9 months ago
KD-TAO / LVOmniBench
View on GitHub
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
☆41Apr 2, 2026Updated 3 months ago
kyunghyuncho / jax-practice
View on GitHub
☆13Aug 17, 2020Updated 5 years ago
G-JWLee / TAMP
View on GitHub
☆12May 15, 2025Updated last year
MingSun-Tse / ASSL
View on GitHub
[NeurIPS'21 Spotlight] Aligned Structured Sparsity Learning for Efficient Image Super-Resolution (PyTorch)
☆61Apr 5, 2022Updated 4 years ago
hilbert9221 / NRI-MPM
View on GitHub
Code for Neural Relational Inference with Efficient Message Passing Mechanisms (AAAI 2021).
☆21May 9, 2021Updated 5 years ago
viridisGreen / EarlyTom
View on GitHub
[CVPR 2026] EarlyTom: Early Token Compression Completes Fast Video Understanding
☆34Jun 22, 2026Updated last month
johndpope / ltx2-castlehill
View on GitHub
CastleHill: Separable Causal Diffusion / Varitaion Flow Maps for LTX-2 long-form video generation
☆15May 19, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
THU-MIG / PrefixKV
View on GitHub
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation [NeurIPS 2025]
☆19Oct 11, 2025Updated 9 months ago
leloykun / steepest-descent-lean
View on GitHub
Deriving steepest descent convergence bounds and hyperparameter scaling laws in machine learning optimization from first principles, form…
☆16Apr 11, 2026Updated 3 months ago
uncbiag / UniLMMV
View on GitHub
☆11Mar 25, 2024Updated 2 years ago
mlvlab / DialogGSR
View on GitHub
Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…
☆12Mar 10, 2025Updated last year
wyzjack / SLA2P
View on GitHub
[TKDE 2024, CIKM 2022] SLA²P: Self-supervised Anomaly Detection with Adversarial Perturbation.
☆39Dec 26, 2024Updated last year
xie-lab-ml / Mano-Restriking-Manifold-Optimization-for-LLM-Training
View on GitHub
The official code of "Mano: Restriking Manifold Optimization for LLM Training".
☆25Jun 1, 2026Updated last month
AMD-AGI / Gumiho
View on GitHub
Official Implementation of "Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding" (ICML'25)
☆36May 14, 2026Updated 2 months ago
CUC-MIPG / UnifyEdit
View on GitHub
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
☆13Dec 29, 2024Updated last year
snap-research / efficient-nn-tutorial
View on GitHub
Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments
☆12Jun 30, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
salesforce / UniControl
View on GitHub
Unified Controllable Visual Generation Model
☆662Jun 2, 2026Updated last month
chenpipi0807 / LTX-Video-Trainer-GUI
View on GitHub
LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具，支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面，使用户能够轻松设置和启动训练流程，无需编写复杂代码。
☆13Jul 18, 2025Updated last year
wyzjack / MRMGA4VAD
View on GitHub
[ICDM 2022] Making Reconstruction-based Method Great Again for Video Anomaly Detection (PyTorch)
☆40Mar 25, 2024Updated 2 years ago
VinAIResearch / HyperCUT
View on GitHub
HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)
☆14Nov 4, 2025Updated 8 months ago
SagiPolaczek / Sync-LoRA
View on GitHub
Official implementation of Sync-LoRA
☆27Jun 25, 2026Updated last month
lldacing / ComfyUI_StableDelight_ll
View on GitHub
☆14Apr 8, 2025Updated last year
Orange-3DV-Team / SmartDirector
View on GitHub
☆25May 29, 2026Updated 2 months ago