apple / ml-mdmLinks
Train high-quality text-to-image diffusion models in a data & compute efficient manner
☆509Updated 6 months ago
Alternatives and similar repositories for ml-mdm
Users that are interested in ml-mdm are comparing it to the libraries listed below
Sorting:
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆398Updated 7 months ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆611Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆427Updated 2 years ago
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆549Updated last year
- Faster generation with text-to-image diffusion models.☆228Updated 3 months ago
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆629Updated 7 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆381Updated 4 months ago
- Official implementation of Inductive Moment Matching☆557Updated 3 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆552Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆406Updated 7 months ago
- Train VAE like a boss☆295Updated last year
- Build your own Face App with Stable Diffusion 2.1☆152Updated 9 months ago
- ☆436Updated last year
- Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen☆426Updated 7 months ago
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆195Updated 3 months ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆281Updated last year
- ☆137Updated last year
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆527Updated last month
- Inference-time scaling of diffusion-based image and video generation models.☆169Updated 3 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆280Updated last year
- Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]☆392Updated 4 months ago
- A Gradio demo of MGIE☆346Updated last year
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025☆459Updated 7 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆157Updated last year
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆235Updated 2 years ago
- Code for instruction-tuning Stable Diffusion.☆241Updated last year
- ☆194Updated last year
- Official implementation of OneDiffusion paper (CVPR 2025)☆650Updated 10 months ago
- Tiny AutoEncoder for Stable Diffusion☆793Updated 6 months ago
- ☆282Updated 9 months ago