StargazerX0/ScaleKV

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/StargazerX0/ScaleKV)

StargazerX0 / ScaleKV

[NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression

☆52

Alternatives and similar repositories for ScaleKV

Users that are interested in ScaleKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VainF / TinyFusion
View on GitHub
[CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow
☆170Dec 1, 2025Updated 7 months ago
czg1225 / VeriThinker
View on GitHub
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆67Sep 27, 2025Updated 9 months ago
czg1225 / CoDe
View on GitHub
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆108Sep 27, 2025Updated 9 months ago
Yuanshi9815 / LiteFocus
View on GitHub
[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.
☆34Mar 11, 2025Updated last year
csguoh / FastVAR
View on GitHub
[ICCV2025]Generate one 2K image on single 24GB 3090 GPU!
☆88Sep 8, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yu-rp / Dimple
View on GitHub
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆117Jul 9, 2025Updated last year
horseee / dKV-Cache
View on GitHub
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆135May 22, 2025Updated last year
VainF / In-Video-Instructions
View on GitHub
[Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control
☆45Nov 25, 2025Updated 7 months ago
Huage001 / URAE
View on GitHub
[ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".
☆118May 3, 2025Updated last year
czg1225 / dParallel
View on GitHub
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆65Apr 12, 2026Updated 3 months ago
bigglesworthnotacat / LLM-Steg
View on GitHub
[ICLR 2026 Oral] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography
☆20Mar 22, 2026Updated 4 months ago
florinshen / Vista3D
View on GitHub
[ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image
☆57Sep 19, 2024Updated last year
haiquanlu / Mix-Quant
View on GitHub
☆37May 21, 2026Updated 2 months ago
horseee / CoT-Valve
View on GitHub
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆91Feb 14, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YinBo0927 / RePro
View on GitHub
The official code of Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
☆22Jan 6, 2026Updated 6 months ago
G-U-N / consolver
View on GitHub
[CVPR 2026 (Highlight)] Unofficial Implementation of "Image Diffusion Preview with Consistency Solver"
☆30Jan 24, 2026Updated 5 months ago
tsa18 / ConciseHint
View on GitHub
[Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation
☆26Oct 1, 2025Updated 9 months ago
Lexie-YU / ViFeEdit
View on GitHub
[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer
☆67Mar 31, 2026Updated 3 months ago
Adamdad / vico
View on GitHub
Vico: Compositional Video Generation as Flow Equalization
☆59Nov 15, 2024Updated last year
Huage001 / StyDeSty
View on GitHub
PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.
☆16Jun 4, 2024Updated 2 years ago
INV-WZQ / SparseD
View on GitHub
[ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models
☆67Feb 22, 2026Updated 5 months ago
YujiaHu1109 / IEAP
View on GitHub
[NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models
☆118Sep 27, 2025Updated 9 months ago
Huage001 / CLEAR
View on GitHub
[NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".
☆219Sep 27, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jiahaolu97 / poison-splat
View on GitHub
(ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"
☆78Feb 13, 2025Updated last year
jiahaolu97 / anything-unsegmentable
View on GitHub
(CVPR 2024) "Unsegment Anything by Simulating Deformation"
☆29May 27, 2024Updated 2 years ago
furiosa-ai / uncage
View on GitHub
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
☆17Aug 12, 2025Updated 11 months ago
Yuanshi9815 / ViBT
View on GitHub
Vision Bridge Transformer at Scale
☆147Dec 1, 2025Updated 7 months ago
czg1225 / DMax
View on GitHub
DMax: Aggressive Parallel Decoding for dLLMs
☆127Jul 5, 2026Updated 2 weeks ago
TencentARC / FluxKits
View on GitHub
☆109Nov 27, 2024Updated last year
fscdc / dVoting
View on GitHub
[arXiv 2026] dVoting: Fast Voting for dLLMs
☆30Feb 13, 2026Updated 5 months ago
Jiang-Yidi / TransformerDistillation-SLU
View on GitHub
☆13Nov 25, 2021Updated 4 years ago
IamCreateAI / CycleVAR
View on GitHub
[ICCV 2025] CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation
☆18Jul 7, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
VainF / Thinkless
View on GitHub
[NeurIPS 2025] Thinkless: LLM Learns When to Think
☆261Sep 26, 2025Updated 9 months ago
NVlabs / MaskLLM
View on GitHub
[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models
☆189Jan 1, 2025Updated last year
MaybeLizzy / PERMU
View on GitHub
☆34Oct 4, 2025Updated 9 months ago
Huage001 / LinFusion
View on GitHub
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
☆317Dec 23, 2024Updated last year
mit-han-lab / lpd
View on GitHub
[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆104May 8, 2026Updated 2 months ago
NVlabs / HMAR
View on GitHub
[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
☆63Jul 8, 2025Updated last year
Lexiang-Xiong / CAD
View on GitHub
[ECCV 2026] Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models
☆28Jun 20, 2026Updated last month