apple / ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
☆444Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ml-mdm
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆445Updated 4 months ago
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆341Updated 2 months ago
- Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.☆153Updated 4 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆268Updated 2 weeks ago
- Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation☆475Updated 4 months ago
- Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen☆334Updated last month
- Train VAE like a boss☆247Updated last month
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆401Updated last year
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆517Updated 10 months ago
- Faster generation with text-to-image diffusion models.☆196Updated last month
- Build your own Face App with Stable Diffusion 2.1☆140Updated last month
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆503Updated 3 months ago
- We're back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️☆252Updated this week
- Official repository for LTX-Video☆379Updated this week
- Rectified Flow Inversion (RF-Inversion)☆269Updated last month
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆362Updated 2 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆528Updated 7 months ago
- On-device Inference of Diffusion Models for Apple Silicon☆510Updated 3 weeks ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆209Updated last month
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆447Updated 5 months ago
- Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed☆421Updated this week
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆346Updated last month
- ☆408Updated 7 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆389Updated 2 weeks ago
- A suite of image and video neural tokenizers☆824Updated last week
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆127Updated last month
- Code for instruction-tuning Stable Diffusion.☆212Updated 9 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆367Updated 3 weeks ago
- Official Implementation of weights2weights☆121Updated this week