TonyLianLong/LLM-groundedDiffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TonyLianLong/LLM-groundedDiffusion)

TonyLianLong / LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)

☆483

Alternatives and similar repositories for LLM-groundedDiffusion

Users that are interested in LLM-groundedDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TonyLianLong / LLM-groundedVideoDiffusion
View on GitHub
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
☆172May 7, 2024Updated 2 years ago
Attention-Refocusing / attention-refocusing
View on GitHub
☆133Jul 17, 2024Updated 2 years ago
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,838Feb 1, 2025Updated last year
tsunghan-wu / SLD
View on GitHub
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
☆187Apr 9, 2024Updated 2 years ago
hananshafi / llmblueprint
View on GitHub
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
☆85May 18, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
naver-ai / DenseDiffusion
View on GitHub
Official Pytorch Implementation of DenseDiffusion (ICCV 2023)
☆508Nov 14, 2023Updated 2 years ago
silent-chen / layout-guidance
View on GitHub
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance
☆267Mar 18, 2024Updated 2 years ago
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆346May 7, 2026Updated 2 months ago
showlab / BoxDiff
View on GitHub
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
☆275Nov 12, 2024Updated last year
gligen / GLIGEN
View on GitHub
Open-Set Grounded Text-to-Image Generation
☆2,226Mar 6, 2024Updated 2 years ago
frank-xwang / InstanceDiffusion
View on GitHub
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
☆614Jun 17, 2025Updated last year
TonyLianLong / igligen
View on GitHub
Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation
☆46Jun 1, 2024Updated 2 years ago
zqiu24 / oft
View on GitHub
Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".
☆300Aug 29, 2025Updated 10 months ago
Qrange-group / SUR-adapter
View on GitHub
ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…
☆120Sep 4, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
omerbt / MultiDiffusion
View on GitHub
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …
☆1,062Sep 21, 2023Updated 2 years ago
google / prompt-to-prompt
View on GitHub
☆3,457May 14, 2024Updated 2 years ago
ziqihuangg / ReVersion
View on GitHub
[SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images
☆503Oct 7, 2025Updated 9 months ago
cientgu / InstructDiffusion
View on GitHub
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
☆445May 14, 2024Updated 2 years ago
MC-E / DragonDiffusion
View on GitHub
ICLR 2024 (Spotlight)
☆788Mar 2, 2024Updated 2 years ago
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,299Oct 31, 2024Updated last year
Zhendong-Wang / Prompt-Diffusion
View on GitHub
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
☆414Mar 25, 2024Updated 2 years ago
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆771Jan 26, 2024Updated 2 years ago
TencentARC / MasaCtrl
View on GitHub
[ICCV 2023] Consistent Image Synthesis and Editing
☆843Aug 19, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Picsart-AI-Research / PAIR-Diffusion
View on GitHub
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
☆521Apr 2, 2024Updated 2 years ago
thu-ml / unidiffuser
View on GitHub
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
☆1,486May 31, 2023Updated 3 years ago
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,285Jul 17, 2024Updated 2 years ago
PRIV-Creation / Awesome-Controllable-T2I-Diffusion-Models
View on GitHub
A collection of resources on controllable generation with text-to-image diffusion models.
☆1,111Dec 31, 2024Updated last year
AILab-CVC / FreeNoise
View on GitHub
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆429Aug 25, 2025Updated 10 months ago
TencentARC / Mix-of-Show
View on GitHub
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
☆427May 14, 2024Updated 2 years ago
ShihaoZhaoZSH / LaVi-Bridge
View on GitHub
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
☆300Jul 17, 2024Updated 2 years ago
j-min / VPGen
View on GitHub
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆57Jul 25, 2023Updated 2 years ago
showlab / Show-o
View on GitHub
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,963Jan 8, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
AILab-CVC / SEED
View on GitHub
Official implementation of SEED-LLaMA (ICLR 2024).
☆642Sep 21, 2024Updated last year
ali-vilab / Ranni
View on GitHub
☆237Apr 10, 2024Updated 2 years ago
SHI-Labs / Prompt-Free-Diffusion
View on GitHub
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024
☆759Nov 16, 2023Updated 2 years ago
TencentARC / T2I-Adapter
View on GitHub
T2I-Adapter
☆3,807Jun 21, 2024Updated 2 years ago
zhenyuw16 / CompAgent_code
View on GitHub
Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".
☆18Jan 30, 2024Updated 2 years ago
tgxs002 / align_sd
View on GitHub
Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
☆293Jul 14, 2023Updated 3 years ago
showlab / VisorGPT
View on GitHub
[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT
☆138May 4, 2024Updated 2 years ago