huggingface / flux-fastLinks
Making Flux go brrr on GPUs.
☆159Updated last month
Alternatives and similar repositories for flux-fast
Users that are interested in flux-fast are comparing it to the libraries listed below
Sorting:
- faster parallel inference of mochi-1 video generation model☆125Updated 11 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆212Updated 4 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆258Updated last year
- Writing FLUX in Triton☆41Updated last year
- ☆48Updated 11 months ago
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆18Updated last year
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆392Updated 3 weeks ago
- ☆79Updated last year
- High-throughput tensor loading for PyTorch☆221Updated last week
- Inference-time scaling of diffusion-based image and video generation models.☆172Updated last month
- Faster generation with text-to-image diffusion models.☆230Updated 7 months ago
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆131Updated 2 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆418Updated 6 months ago
- An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…☆150Updated 7 months ago
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆200Updated 6 months ago
- [WIP] Better (FP8) attention for Hopper☆32Updated 11 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Updated 6 months ago
- Minimal Differentiable Image Reward Functions☆106Updated 5 months ago
- [ICLR'2026] Scale-wise Distillation of Diffusion Models☆113Updated 4 months ago
- The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆257Updated 2 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆213Updated 4 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆121Updated 11 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated last year
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆101Updated 4 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆313Updated last year
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆342Updated 4 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆78Updated 10 months ago
- Tiny AutoEncoder for Hunyuan Video (and other video models)☆294Updated 2 weeks ago
- [NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation☆575Updated 2 months ago