poloclub / diffusion-explainerLinks
Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion
☆380Updated last year
Alternatives and similar repositories for diffusion-explainer
Users that are interested in diffusion-explainer are comparing it to the libraries listed below
Sorting:
- SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.☆537Updated 4 months ago
- Build your own Face App with Stable Diffusion 2.1☆151Updated 7 months ago
- ☆435Updated last year
- Repository for the Paper "Multi-LoRA Composition for Image Generation"☆481Updated last year
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆194Updated this week
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆618Updated 5 months ago
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆395Updated 5 months ago
- Train high-quality text-to-image diffusion models in a data & compute efficient manner☆504Updated 5 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆406Updated 6 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆377Updated 3 months ago
- ☆193Updated last year
- Faster generation with text-to-image diffusion models.☆226Updated 2 months ago
- Documentation, notes, links, etc for streams.☆83Updated last year
- Official code for the CVPR 2025 paper "SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models."☆577Updated 3 months ago
- A curated list of awesome resources for FLUX, the state-of-the-art text-to-image model by Black Forest Labs.☆106Updated last year
- ☆204Updated last year
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆545Updated last year
- An initiative to replicate Sora☆103Updated last year
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆233Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆235Updated last year
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆37Updated 6 months ago
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆193Updated last month
- Contains the public resources of Hands on GenAI book☆189Updated 7 months ago
- (CVPR 2025) Code of "Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models"☆179Updated 5 months ago
- ☆471Updated 2 months ago
- DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework☆349Updated 8 months ago
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆328Updated 2 weeks ago
- documentation for content creation☆219Updated last week
- ☆257Updated 3 months ago
- ☆133Updated last year