apple / ml-mdmLinks
Train high-quality text-to-image diffusion models in a data & compute efficient manner
☆515Updated 10 months ago
Alternatives and similar repositories for ml-mdm
Users that are interested in ml-mdm are comparing it to the libraries listed below
Sorting:
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆403Updated 10 months ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆631Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆435Updated 2 years ago
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆651Updated 10 months ago
- Train VAE like a boss☆313Updated last year
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆561Updated 2 years ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆392Updated last month
- Official implementation of Inductive Moment Matching☆570Updated 6 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆415Updated 11 months ago
- Faster generation with text-to-image diffusion models.☆230Updated 7 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆556Updated last year
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆534Updated 5 months ago
- ☆142Updated last year
- Build your own Face App with Stable Diffusion 2.1☆154Updated last year
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆200Updated 6 months ago
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025☆469Updated 10 months ago
- Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen☆433Updated 11 months ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆284Updated last year
- Inference-time scaling of diffusion-based image and video generation models.☆172Updated last month
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆281Updated last year
- ☆441Updated last year
- Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]☆406Updated 8 months ago
- Code for instruction-tuning Stable Diffusion.☆248Updated last year
- ☆282Updated last year
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆237Updated 2 years ago
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆394Updated last year
- Official Implementation of weights2weights☆154Updated 11 months ago
- Tiny AutoEncoder for Stable Diffusion (and other image models)☆877Updated 2 weeks ago
- Official implementation of OneDiffusion paper (CVPR 2025)☆663Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆158Updated 2 years ago