huggingface/flux-fast

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huggingface/flux-fast)

huggingface / flux-fast

Making Flux go brrr on GPUs.

☆170

Alternatives and similar repositories for flux-fast

Users that are interested in flux-fast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xlite-dev / flux-faster
View on GitHub
A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.
☆24Jul 18, 2025Updated last year
huggingface / lora-fast
View on GitHub
Minimal repository to demonstrate fast LoRA inference with Flux family of models.
☆32Jul 23, 2025Updated last year
sayakpaul / diffusers-torchao
View on GitHub
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
☆399Jan 8, 2026Updated 6 months ago
vipshop / cache-dit
View on GitHub
A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.
☆1,239Updated this week
chengzeyi / ParaAttention
View on GitHub
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
☆427Jul 5, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hao-ai-lab / Awesome-Video-Attention
View on GitHub
A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cach…
☆61Oct 27, 2025Updated 9 months ago
G-U-N / consolver
View on GitHub
[CVPR 2026 (Highlight)] Unofficial Implementation of "Image Diffusion Preview with Consistency Solver"
☆31Jan 24, 2026Updated 6 months ago
a-r-r-o-w / productionizing-diffusion
View on GitHub
Optimizing diffusion for production-ready speeds
☆40Jan 10, 2026Updated 6 months ago
fal-ai / flashpack
View on GitHub
High-throughput tensor loading for PyTorch
☆260Updated this week
shawnricecake / draft-attention
View on GitHub
Code for Draft Attention
☆103May 22, 2025Updated last year
huggingface / diffusion-fast
View on GitHub
Faster generation with text-to-image diffusion models.
☆234Jun 28, 2025Updated last year
huggingface / kernel-builder
View on GitHub
👷 Build compute kernels
☆213Apr 6, 2026Updated 3 months ago
mit-han-lab / radial-attention
View on GitHub
[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation
☆604Nov 11, 2025Updated 8 months ago
chengzeyi / stable-fast
View on GitHub
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
☆1,304Mar 27, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sayakpaul / q8-ltx-video
View on GitHub
This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.
☆25Jan 7, 2025Updated last year
nunchaku-ai / nunchaku
View on GitHub
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
☆3,920Mar 7, 2026Updated 4 months ago
Shenyi-Z / TaylorSeer
View on GitHub
[ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
☆408Mar 2, 2026Updated 4 months ago
sandyresearch / chipmunk
View on GitHub
🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …
☆111Sep 8, 2025Updated 10 months ago
maxin-cn / OmniPainter
View on GitHub
Training-free Stylized Text-to-Image Generation with Fast Inference
☆28May 30, 2025Updated last year
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,888Updated this week
xlite-dev / Awesome-DiT-Inference
View on GitHub
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
☆579Jun 13, 2026Updated last month
NVlabs / FastGen
View on GitHub
NVIDIA FastGen: Fast Generation from Diffusion Models
☆871Updated this week
Zehong-Ma / MagCache
View on GitHub
The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"
☆275Nov 17, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
thu-ml / SpargeAttn
View on GitHub
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
☆1,019Feb 25, 2026Updated 5 months ago
huggingface / kernels
View on GitHub
Build compute kernels and load them from the Hub.
☆715Updated this week
RE-N-Y / imscore
View on GitHub
Minimal Differentiable Image Reward Functions
☆119Mar 30, 2026Updated 3 months ago
sayakpaul / tt-scale-flux
View on GitHub
Inference-time scaling of diffusion-based image and video generation models.
☆174Dec 17, 2025Updated 7 months ago
UnicomAI / LeMiCa
View on GitHub
[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
☆122Jun 22, 2026Updated last month
sayakpaul / nanoDiT
View on GitHub
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
☆157May 29, 2025Updated last year
MCG-NJU / PixNerd
View on GitHub
[ICLR 2026] PixNerd: Pixel Neural Field Diffusion
☆185Dec 10, 2025Updated 7 months ago
sayakpaul / simple-image-recaptioning
View on GitHub
Recaption large (Web)Datasets with vllm and save the artifacts.
☆53Nov 23, 2024Updated last year
AMD-AGI / Nitro-T
View on GitHub
Nitro-T is a family of text-to-image diffusion models focused on highly efficient training.
☆41Jun 4, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tang-bd / fuse-dit
View on GitHub
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
☆140May 16, 2025Updated last year
svg-project / Sparse-VideoGen
View on GitHub
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
☆697Jul 4, 2026Updated 3 weeks ago
inclusionAI / TwinFlow
View on GitHub
[ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻
☆537Feb 24, 2026Updated 5 months ago
Lakonik / LakonLab
View on GitHub
Official implementation of AsymFlow, pi-Flow, GMFlow
☆455Jul 14, 2026Updated 2 weeks ago
moonmath-ai / LiteAttention
View on GitHub
Transforming Video Diffusion with Temporal Sparse Attention
☆56Apr 8, 2026Updated 3 months ago
SandAI-org / MagiAttention
View on GitHub
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
☆891Updated this week
tianweiy / CausVid
View on GitHub
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
☆1,409Aug 7, 2025Updated 11 months ago