apple / ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
☆491Updated last month
Alternatives and similar repositories for ml-mdm:
Users that are interested in ml-mdm are comparing it to the libraries listed below
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆517Updated 10 months ago
- Train VAE like a boss☆276Updated 6 months ago
- Official implementation of Inductive Moment Matching☆458Updated last month
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆379Updated last month
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆584Updated last month
- Faster generation with text-to-image diffusion models.☆213Updated 7 months ago
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆539Updated last year
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆506Updated 11 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆420Updated last year
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆346Updated 2 months ago
- Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen☆383Updated 2 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆395Updated 2 months ago
- SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.☆513Updated last month
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆590Updated last month
- Official implementation of OneDiffusion paper (CVPR 2025)☆625Updated 4 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆351Updated 3 months ago
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆307Updated last month
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆178Updated last month
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆410Updated 5 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆274Updated 9 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆575Updated 6 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆544Updated last year
- Memory-optimized training library for diffusion models☆1,120Updated this week
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,275Updated 2 weeks ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,735Updated 8 months ago
- ☆488Updated 5 months ago
- Inference-time scaling of diffusion-based image and video generation models.☆142Updated 2 months ago
- (NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis☆769Updated 2 months ago
- Tiny AutoEncoder for Stable Diffusion☆710Updated 2 weeks ago
- Official Implementation of weights2weights☆141Updated 2 months ago