apple / ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
☆483Updated last month
Alternatives and similar repositories for ml-mdm:
Users that are interested in ml-mdm are comparing it to the libraries listed below
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆491Updated 8 months ago
- Train VAE like a boss☆270Updated 5 months ago
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆566Updated 2 weeks ago
- Official implementation of Inductive Moment Matching☆413Updated 2 weeks ago
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆372Updated last week
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆388Updated 3 weeks ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆413Updated last year
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆334Updated last month
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆328Updated 2 months ago
- (NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis☆720Updated 3 weeks ago
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆534Updated last year
- Faster generation with text-to-image diffusion models.☆211Updated 5 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆548Updated 7 months ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆492Updated 9 months ago
- Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen☆378Updated 2 weeks ago
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆397Updated this week
- Official implementation of OneDiffusion paper (CVPR 2025)☆617Updated 3 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆541Updated 11 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,623Updated 7 months ago
- ☆426Updated 11 months ago
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,245Updated 4 months ago
- Inference-time scaling of diffusion-based image and video generation models.☆117Updated 3 weeks ago
- SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.☆492Updated last month
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025☆370Updated last week
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆168Updated this week
- ☆444Updated 3 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,361Updated 2 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆441Updated 5 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆269Updated 7 months ago
- Code for instruction-tuning Stable Diffusion.☆223Updated last year