xuyang-liu16/Awesome-Generation-Acceleration

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xuyang-liu16/Awesome-Generation-Acceleration)

xuyang-liu16 / Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

☆402

Alternatives and similar repositories for Awesome-Generation-Acceleration

Users that are interested in Awesome-Generation-Acceleration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Shenyi-Z / ToCa
View on GitHub
[ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching
☆220Mar 14, 2025Updated last year
xuyang-liu16 / Awesome-Token-level-Model-Compression
View on GitHub
📚 Collection of token-level model compression resources.
☆200Sep 3, 2025Updated 10 months ago
Shenyi-Z / TaylorSeer
View on GitHub
[ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
☆406Mar 2, 2026Updated 4 months ago
xlite-dev / Awesome-DiT-Inference
View on GitHub
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
☆578Jun 13, 2026Updated last month
Shenyi-Z / Cache4Diffusion
View on GitHub
Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.
☆110Oct 23, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Tammytcl / Awesome-Diffusion-Acceleration-Cache
View on GitHub
A curated list of research papers, resources, and advancements on Diffusion Cache and related efficient diffusion model acceleration tech…
☆86Nov 4, 2025Updated 8 months ago
pratheba / FORA
View on GitHub
FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
☆56Jul 8, 2024Updated 2 years ago
svg-project / Sparse-VideoGen
View on GitHub
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
☆693Jul 4, 2026Updated 2 weeks ago
thu-nics / DiTFastAttn
View on GitHub
☆192Jan 14, 2025Updated last year
horseee / learning-to-cache
View on GitHub
[NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
☆122Jul 15, 2024Updated 2 years ago
thu-nics / ViDiT-Q
View on GitHub
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆163Mar 21, 2025Updated last year
maomaocun / dLLM-cache
View on GitHub
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆211May 1, 2026Updated 2 months ago
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,865Updated this week
xuyang-liu16 / GlobalCom2
View on GitHub
[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models
☆42Jan 27, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ali-vilab / TeaCache
View on GitHub
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
☆1,357Jun 8, 2025Updated last year
yuriYanZeXuan / EEdit
View on GitHub
(ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing
☆62Sep 17, 2025Updated 10 months ago
JIA-Lab-research / Jenga
View on GitHub
[NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving
☆287Aug 4, 2025Updated 11 months ago
HaozheLiu-ST / T-GATE
View on GitHub
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
☆418Feb 26, 2025Updated last year
Vchitect / FasterCache
View on GitHub
[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
☆263Dec 27, 2024Updated last year
vipshop / cache-dit
View on GitHub
A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.
☆1,234Updated this week
oahzxl / Awesome-Efficient-Video-Generation
View on GitHub
A curated list of recent efficient video generation methods.
☆72Oct 7, 2025Updated 9 months ago
NUS-HPC-AI-Lab / VideoSys
View on GitHub
VideoSys: An easy and efficient system for video generation
☆2,026Aug 27, 2025Updated 10 months ago
xuyang-liu16 / hermes-code-bridge
View on GitHub
Use Hermes Agent as the control plane for local coding agents like Codex, Kimi Code, Claude Code, OpenCode, and Gemini CLI.
☆23May 28, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
horseee / DeepCache
View on GitHub
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
☆970Jun 27, 2024Updated 2 years ago
Shenyi-Z / DuCa
View on GitHub
(ToCa-v2) A New version of ToCa，with faster speed and better acceleration!
☆42Mar 13, 2025Updated last year
ThisisBillhe / ZipAR
View on GitHub
[ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…
☆51Mar 25, 2025Updated last year
thu-ml / SpargeAttn
View on GitHub
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
☆1,017Feb 25, 2026Updated 4 months ago
joelulu / Awesome-Acceleration-GenAI
View on GitHub
Collection of Acceleration Methods for Generative AI
☆29Dec 9, 2025Updated 7 months ago
Bujiazi / DiCache
View on GitHub
[ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache
☆61Jan 26, 2026Updated 5 months ago
lhxcs / DVD-Quant
View on GitHub
☆17Oct 5, 2025Updated 9 months ago
thu-nics / FrameFusion
View on GitHub
[ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"
☆76Jan 13, 2026Updated 6 months ago
NVlabs / rcm
View on GitHub
rCM & Causal-rCM: Leading and Unified Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale
☆768Jun 25, 2026Updated 3 weeks ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,404Mar 5, 2025Updated last year
AIoT-MLSys-Lab / Efficient-Diffusion-Model-Survey
View on GitHub
[TMLR 2025] Efficient Diffusion Models: A Survey
☆185Dec 8, 2025Updated 7 months ago
ethansmith2000 / ImprovedTokenMerge
View on GitHub
☆49Mar 3, 2024Updated 2 years ago
AdaCache-DiT / AdaCache
View on GitHub
Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"
☆172Nov 5, 2024Updated last year
mit-han-lab / radial-attention
View on GitHub
[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation
☆603Nov 11, 2025Updated 8 months ago
xdit-project / xDiT
View on GitHub
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
☆2,660Jul 14, 2026Updated last week
KlingAIResearch / VMoBA
View on GitHub
Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"
☆64Jul 1, 2025Updated last year