huggingface / diffusion-fastLinks
Faster generation with text-to-image diffusion models.
☆229Updated 4 months ago
Alternatives and similar repositories for diffusion-fast
Users that are interested in diffusion-fast are comparing it to the libraries listed below
Sorting:
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆380Updated 5 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆408Updated 8 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆206Updated last month
- ☆137Updated last year
- Official Repository of the paper "Trajectory Consistency Distillation"☆355Updated last year
- Making Flux go brrr on GPUs.☆153Updated 3 months ago
- ☆438Updated last year
- Diffusion Reinforcement Learning Library☆190Updated last year
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆285Updated last year
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆401Updated 7 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆157Updated 2 years ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆430Updated 2 years ago
- Generate long weighted prompt embeddings for Stable Diffusion☆144Updated 6 months ago
- ☆126Updated 8 months ago
- ☆49Updated last year
- ☆51Updated 2 years ago
- Code for instruction-tuning Stable Diffusion.☆244Updated last year
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆236Updated 2 years ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆636Updated 8 months ago
- Iterable datapipelines for pytorch training.☆87Updated last year
- Writing FLUX in Triton☆41Updated last year
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆309Updated 10 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆553Updated last year
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆552Updated last year
- faster parallel inference of mochi-1 video generation model☆125Updated 8 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆251Updated 10 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆388Updated 4 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆713Updated 11 months ago