joelulu/Awesome-Acceleration-GenAI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/joelulu/Awesome-Acceleration-GenAI)

joelulu / Awesome-Acceleration-GenAI

Collection of Acceleration Methods for Generative AI

☆29

Alternatives and similar repositories for Awesome-Acceleration-GenAI

Users that are interested in Awesome-Acceleration-GenAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UnicomAI / LeMiCa
View on GitHub
[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
☆122Jun 22, 2026Updated last month
Tammytcl / Awesome-Diffusion-Acceleration-Cache
View on GitHub
A curated list of research papers, resources, and advancements on Diffusion Cache and related efficient diffusion model acceleration tech…
☆86Updated this week
UnicomAI / ShortDF
View on GitHub
Optimizing for the Shortest Path in Denoising Diffusion Model (CVPR2025)
☆20Dec 17, 2025Updated 7 months ago
UnicomAI / MeanCache
View on GitHub
[ICLR 2026] MeanCache: From Instantaneous to Average Velocity for Accelerating Flow Matching Inference
☆33Feb 5, 2026Updated 5 months ago
UnicomAI / HiMo-CLIP
View on GitHub
[AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment
☆29Dec 17, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
UnicomAI / CoTj
View on GitHub
CoTj (Chain-of-Trajectories) upgrades diffusion models from fixed System-1 denoising schedules to System-2 style, condition-adaptive traj…
☆23Mar 24, 2026Updated 4 months ago
xlite-dev / qwen-image-fast
View on GitHub
⚡️Qwen-Image 4.8x🎉 speedup with Hybrid Acceleration for low VRAM GPUs
☆17Oct 24, 2025Updated 9 months ago
flash-bon / flash-bon
View on GitHub
(ECCV 2026): Official code for Flash-BoN: Instant Drafts for Inference-Time Scaling in Diffusion Models
☆18Jul 9, 2026Updated 2 weeks ago
bytedance / ERTACache
View on GitHub
☆25Sep 4, 2025Updated 10 months ago
CSU-JPG / Glance
View on GitHub
[ECCV 2026] Glance: Accelerating Diffusion Models with 1 Sample
☆155Updated this week
HiDream-ai / DreamJourney
View on GitHub
[TMM 2025] Official Implementation of DreamJourney: Perpetual View Generation with Video Diffusion Models
☆18Jun 24, 2025Updated last year
arielshaulov / TokenTrim
View on GitHub
Official implementation of the paper "TOKENTRIM: INFERENCE-TIME TOKEN PRUNING FOR AUTOREGRESSIVE LONG VIDEO GENERATION"
☆15Feb 8, 2026Updated 5 months ago
mikeallen39 / FlowCache
View on GitHub
[ICLR2026] The open-source code for FlowCache, including accelerated implementations of the MAGI-1 and Skyreels-V2.
☆30Apr 24, 2026Updated 3 months ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆29Sep 4, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vipshop / cache-dit
View on GitHub
A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.
☆1,239Updated this week
ModelTC / QVGen
View on GitHub
[ICLR 2026] This is the official PyTorch implementation of "QVGen: Pushing the Limit of Quantized Video Generative Models".
☆32Feb 11, 2026Updated 5 months ago
Tencent-Hunyuan / DisCa
View on GitHub
DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
☆24Apr 15, 2026Updated 3 months ago
Orange-3DV-Team / SmartDirector
View on GitHub
☆25May 29, 2026Updated 2 months ago
0xWelt / VibeRL
View on GitHub
VibeRL is a Reinforcement Learning framework built essentially through vibe coding with Kimi K2.
☆17Jul 20, 2026Updated last week
xuyang-liu16 / Awesome-Generation-Acceleration
View on GitHub
📚 Collection of awesome generation acceleration resources.
☆402Jul 7, 2025Updated last year
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 3 months ago
pnotp / ArcFlow
View on GitHub
ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation
☆128May 20, 2026Updated 2 months ago
ModelTC / GenRL
View on GitHub
Reinforcement Learning Framework for Visual Generation
☆126Feb 13, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Yifei-Zuo / FlashLLA
View on GitHub
Official repository Flash Local Linear Attention
☆38May 28, 2026Updated 2 months ago
suimuc / MTV_Framework
View on GitHub
☆23Oct 15, 2025Updated 9 months ago
ziplab / Pyramid-Sparse-Attention
View on GitHub
Official PyTorch implementation of [PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation](https://arxiv.org/abs…
☆25Jan 25, 2026Updated 6 months ago
xlite-dev / longcat-video-fast
View on GitHub
🔥LongCat-Video 1.7x🎉 speedup: cache acceleration and 4/8-bits weight only.
☆15Oct 28, 2025Updated 9 months ago
lhxcs / DVD-Quant
View on GitHub
☆17Oct 5, 2025Updated 9 months ago
hyj542682306 / Semantic-Frame-Interpolation
View on GitHub
☆21Jul 8, 2025Updated last year
Trans-Diff / TransDiff
View on GitHub
☆20Aug 1, 2025Updated 11 months ago
AniAggarwal / ecad
View on GitHub
[ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
☆30Mar 1, 2026Updated 4 months ago
zishen-ucap / PromptTea
View on GitHub
This repository contains the official implementation of our paper: PromptTea: Let Prompts Tell TeaCache the Optimal Threshold
☆35Oct 27, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Shenyi-Z / Cache4Diffusion
View on GitHub
Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.
☆110Oct 23, 2025Updated 9 months ago
Westlake-AGI-Lab / FlowDirector
View on GitHub
Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"
☆89Dec 12, 2025Updated 7 months ago
feifeibear / SeeReel
View on GitHub
Agent-native Seedance 2.0 short-film studio: cli for AI, canvas for human
☆16Jun 14, 2026Updated last month
Zehong-Ma / MagCache
View on GitHub
The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"
☆275Nov 17, 2025Updated 8 months ago
HelloZicky / NaviCache
View on GitHub
[ICML 2026] Official implementation of "NaviCache: Test-Time Self-Calibration Caching for Video Generation".
☆27Jul 9, 2026Updated 3 weeks ago
LINs-lab / UCGM
View on GitHub
[Preprint] UCGM: Unified Continuous Generative Models
☆185May 27, 2025Updated last year
kandinskylab / kandinsky-5-lora-train
View on GitHub
☆20Nov 25, 2025Updated 8 months ago