LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
☆480Sep 9, 2024Updated last year
Alternatives and similar repositories for LLM-groundedDiffusion
Users that are interested in LLM-groundedDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆168May 7, 2024Updated last year
- ☆133Jul 17, 2024Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,843Feb 1, 2025Updated last year
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆187Apr 9, 2024Updated last year
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆84May 18, 2024Updated last year
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆502Nov 14, 2023Updated 2 years ago
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆266Mar 18, 2024Updated 2 years ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆334Dec 24, 2025Updated 2 months ago
- [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion☆275Nov 12, 2024Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,212Mar 6, 2024Updated 2 years ago
- [CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"☆607Jun 17, 2025Updated 9 months ago
- Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation☆46Jun 1, 2024Updated last year
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆298Aug 29, 2025Updated 6 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆120Sep 4, 2025Updated 6 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,057Sep 21, 2023Updated 2 years ago
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆504Oct 7, 2025Updated 5 months ago
- ☆3,444May 14, 2024Updated last year
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆443May 14, 2024Updated last year
- ICLR 2024 (Spotlight)☆786Mar 2, 2024Updated 2 years ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆414Mar 25, 2024Updated last year
- [ICCV 2023] Consistent Image Synthesis and Editing☆843Aug 19, 2024Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆522Apr 2, 2024Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,281Jul 17, 2024Updated last year
- A collection of resources on controllable generation with text-to-image diffusion models.☆1,113Dec 31, 2024Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆767Jan 26, 2024Updated 2 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,476May 31, 2023Updated 2 years ago
- NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models☆428May 14, 2024Updated last year
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation☆299Jul 17, 2024Updated last year
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆428Aug 25, 2025Updated 6 months ago
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆57Jul 25, 2023Updated 2 years ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆642Sep 21, 2024Updated last year
- ☆238Apr 10, 2024Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆759Nov 16, 2023Updated 2 years ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,895Jan 8, 2026Updated 2 months ago
- T2I-Adapter☆3,803Jun 21, 2024Updated last year
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".☆18Jan 30, 2024Updated 2 years ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Jul 14, 2023Updated 2 years ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆556Apr 6, 2024Updated last year