Making Flux go brrr on GPUs.
☆166Jan 5, 2026Updated 3 months ago
Alternatives and similar repositories for flux-fast
Users that are interested in flux-fast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 8 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆397Jan 8, 2026Updated 3 months ago
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆104Sep 8, 2025Updated 7 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆426Jul 5, 2025Updated 9 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Apr 7, 2026Updated last week
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.☆15Aug 25, 2024Updated last year
- pytorch implementation of grok☆12Apr 6, 2026Updated last week
- Faster generation with text-to-image diffusion models.☆232Jun 28, 2025Updated 9 months ago
- Memory-optimized training scripts for video models based on Diffusers☆16Jan 3, 2025Updated last year
- Nitro-T is a family of text-to-image diffusion models focused on highly efficient training.☆40Jul 10, 2025Updated 9 months ago
- EleutherAI ML Performance reading group repository (slides, meeting recordings, annotated papers)☆31Mar 20, 2026Updated 3 weeks ago
- Minimal Differentiable Image Reward Functions☆110Mar 30, 2026Updated 2 weeks ago
- Flash Sculptor: Modular 3D Worlds from Objects☆33Apr 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- mask2former psg☆22Dec 12, 2022Updated 3 years ago
- faster parallel inference of mochi-1 video generation model☆125Feb 25, 2025Updated last year
- ☆25Sep 19, 2025Updated 6 months ago
- ☆32Jan 30, 2023Updated 3 years ago
- ☆14Sep 22, 2025Updated 6 months ago
- The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆265Nov 17, 2025Updated 5 months ago
- Official implementation of Log-linear Sparse Attention (LLSA).☆70Feb 2, 2026Updated 2 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆200Apr 13, 2025Updated last year
- [ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.☆973Feb 25, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Attempt at cog wrapper for a SDXL CLIP Interrogator☆10May 16, 2024Updated last year
- [NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation☆589Nov 11, 2025Updated 5 months ago
- Minimal repository to demonstrate fast LoRA inference with Flux family of models.☆31Jul 23, 2025Updated 8 months ago
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Aug 6, 2025Updated 8 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,365Apr 10, 2026Updated last week
- ☆11Oct 6, 2022Updated 3 years ago
- ☆14Mar 22, 2025Updated last year
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆44May 28, 2024Updated last year
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆48Jul 17, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Wan: Open and Advanced Large-Scale Video Generative Models☆29Jul 28, 2025Updated 8 months ago
- [ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆388Mar 2, 2026Updated last month
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,306Mar 27, 2025Updated last year
- Chat with your images using Black Forest Lab's FLUX.1 Kontext☆90Jun 9, 2025Updated 10 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆835Updated this week
- Distributed parallel 3D-Causal-VAE for efficient training and inference☆47Aug 20, 2025Updated 7 months ago
- A PyTorch-native inference engine with cache acceleration, parallelism and quantization for DiTs.☆1,130Updated this week