Making Flux go brrr on GPUs.
☆168Jan 5, 2026Updated 4 months ago
Alternatives and similar repositories for flux-fast
Users that are interested in flux-fast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 10 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆396Jan 8, 2026Updated 4 months ago
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆109Sep 8, 2025Updated 8 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆427Jul 5, 2025Updated 10 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Apr 7, 2026Updated last month
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.☆15Aug 25, 2024Updated last year
- pytorch implementation of grok☆11May 18, 2026Updated last week
- Memory-optimized training scripts for video models based on Diffusers☆16Jan 3, 2025Updated last year
- Faster generation with text-to-image diffusion models.☆233Jun 28, 2025Updated 10 months ago
- The Golang-based library for packet manipulation and dissection☆10Mar 10, 2024Updated 2 years ago
- Nitro-T is a family of text-to-image diffusion models focused on highly efficient training.☆41Jul 10, 2025Updated 10 months ago
- ☆13May 11, 2026Updated 2 weeks ago
- EleutherAI ML Performance reading group repository (slides, meeting recordings, annotated papers)☆30Mar 20, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- High-performance safetensors model loader☆143May 19, 2026Updated last week
- Nodes to level up your workflows performance and streamline specific functions.☆10Aug 19, 2025Updated 9 months ago
- Flash Sculptor: Modular 3D Worlds from Objects☆33Apr 13, 2025Updated last year
- Minimal Differentiable Image Reward Functions☆116Mar 30, 2026Updated last month
- faster parallel inference of mochi-1 video generation model☆123Feb 25, 2025Updated last year
- mask2former psg☆22Dec 12, 2022Updated 3 years ago
- ☆14Sep 22, 2025Updated 8 months ago
- The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆269Nov 17, 2025Updated 6 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆201Apr 13, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.☆995Feb 25, 2026Updated 3 months ago
- Attempt at cog wrapper for a SDXL CLIP Interrogator☆10May 16, 2024Updated 2 years ago
- [NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation☆598Nov 11, 2025Updated 6 months ago
- Minimal repository to demonstrate fast LoRA inference with Flux family of models.☆32Jul 23, 2025Updated 10 months ago
- [CVPR 2026 Highlight] Official implementation of Log-linear Sparse Attention (LLSA).☆77May 1, 2026Updated 3 weeks ago
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Aug 6, 2025Updated 9 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,504Updated this week
- ☆11Oct 6, 2022Updated 3 years ago
- ☆15Mar 22, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆45May 28, 2024Updated last year
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆48Jul 17, 2025Updated 10 months ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆29Jul 28, 2025Updated 9 months ago
- Chat with your images using Black Forest Lab's FLUX.1 Kontext☆91Jun 9, 2025Updated 11 months ago
- [ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆397Mar 2, 2026Updated 2 months ago
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,303Mar 27, 2025Updated last year
- Distributed parallel 3D-Causal-VAE for efficient training and inference☆47Aug 20, 2025Updated 9 months ago