tsunghan-wu/SLD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tsunghan-wu/SLD)

tsunghan-wu / SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

☆187

Alternatives and similar repositories for SLD

Users that are interested in SLD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hananshafi / llmblueprint
View on GitHub
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
☆85May 18, 2024Updated 2 years ago
TonyLianLong / LLM-groundedDiffusion
View on GitHub
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…
☆483Sep 9, 2024Updated last year
TonyLianLong / LLM-groundedVideoDiffusion
View on GitHub
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
☆172May 7, 2024Updated 2 years ago
xiefan-guo / initno
View on GitHub
[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
☆80Jun 7, 2024Updated 2 years ago
zhenyuw16 / CompAgent_code
View on GitHub
Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".
☆18Jan 30, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
louisYen / Gen4Gen
View on GitHub
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
☆110Mar 27, 2026Updated 3 months ago
frank-xwang / InstanceDiffusion
View on GitHub
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
☆614Jun 17, 2025Updated last year
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,838Feb 1, 2025Updated last year
ali-vilab / Ranni
View on GitHub
☆237Apr 10, 2024Updated 2 years ago
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆346May 7, 2026Updated 2 months ago
omer11a / bounded-attention
View on GitHub
☆96Sep 22, 2024Updated last year
PlusLabNLP / VISCO
View on GitHub
[CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
☆13Jun 7, 2025Updated last year
visual-haystacks / mirage
View on GitHub
🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
☆27Feb 9, 2025Updated last year
yardenfren1996 / B-LoRA
View on GitHub
Implicit Style-Content Separation using B-LoRA
☆402Nov 14, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HITsz-TMG / Agentic-CIGEval
View on GitHub
Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".
☆31Jul 22, 2025Updated last year
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,285Jul 17, 2024Updated 2 years ago
silent-chen / layout-guidance
View on GitHub
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance
☆267Mar 18, 2024Updated 2 years ago
ShihaoZhaoZSH / LaVi-Bridge
View on GitHub
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
☆300Jul 17, 2024Updated 2 years ago
SPRIGHT-T2I / SPRIGHT
View on GitHub
[ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"
☆105Jul 5, 2024Updated 2 years ago
OpenGVLab / Diffree
View on GitHub
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
☆239May 5, 2025Updated last year
hohonu-vicml / DirectedDiffusion
View on GitHub
Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)
☆82Feb 22, 2024Updated 2 years ago
OrLichter / lcm-lookahead
View on GitHub
☆57Apr 30, 2024Updated 2 years ago
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆771Jan 26, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TencentARC / CustomNet
View on GitHub
☆289Jul 22, 2024Updated 2 years ago
Diffusion-CoT / ReflectionFlow
View on GitHub
[ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
☆220Nov 5, 2025Updated 8 months ago
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,299Oct 31, 2024Updated last year
showlab / X-Adapter
View on GitHub
[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
☆771Aug 14, 2024Updated last year
showlab / BoxDiff
View on GitHub
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
☆275Nov 12, 2024Updated last year
Zhen-Dong / Magic-Me
View on GitHub
Codes for ID-Specific Video Customized Diffusion
☆460Feb 22, 2024Updated 2 years ago
FrankFundel / SGCond
View on GitHub
☆10Jun 28, 2023Updated 3 years ago
Attention-Refocusing / attention-refocusing
View on GitHub
☆133Jul 17, 2024Updated 2 years ago
magic-research / piecewise-rectified-flow
View on GitHub
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
☆538Sep 8, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sled-group / InfEdit
View on GitHub
[CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"
☆362May 28, 2024Updated 2 years ago
nipunjindal / diffusers-layout-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".
☆42May 24, 2023Updated 3 years ago
IBM / DiffuseKronA
View on GitHub
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
☆131Sep 18, 2025Updated 10 months ago
Young98CN / LoRA_Composer
View on GitHub
[TIP 2025] LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
☆65Aug 14, 2024Updated last year
SHI-Labs / T2I-Copilot
View on GitHub
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)
☆56Oct 6, 2025Updated 9 months ago
Qrange-group / SUR-adapter
View on GitHub
ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…
☆120Sep 4, 2025Updated 10 months ago
agwmon / MuDI
View on GitHub
[NeurIPS 2024] MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models
☆96Jan 17, 2025Updated last year