cientgu/InstructDiffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cientgu/InstructDiffusion)

cientgu / InstructDiffusion

PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.

☆445

Alternatives and similar repositories for InstructDiffusion

Users that are interested in InstructDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TencentARC / SmartEdit
View on GitHub
Official code of SmartEdit [CVPR-2024 Highlight]
☆374Jun 21, 2024Updated 2 years ago
OSU-NLP-Group / MagicBrush
View on GitHub
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
☆411Feb 20, 2025Updated last year
Zhendong-Wang / Prompt-Diffusion
View on GitHub
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
☆414Mar 25, 2024Updated 2 years ago
TencentARC / MasaCtrl
View on GitHub
[ICCV 2023] Consistent Image Synthesis and Editing
☆843Aug 19, 2024Updated last year
salesforce / HIVE
View on GitHub
☆121Jun 2, 2026Updated last month
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
naver-ai / DenseDiffusion
View on GitHub
Official Pytorch Implementation of DenseDiffusion (ICCV 2023)
☆507Nov 14, 2023Updated 2 years ago
sled-group / InfEdit
View on GitHub
[CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"
☆362May 28, 2024Updated 2 years ago
ziqihuangg / ReVersion
View on GitHub
[SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images
☆504Oct 7, 2025Updated 9 months ago
QianWangX / InstructEdit
View on GitHub
Implementation of InstructEdit
☆75Oct 30, 2023Updated 2 years ago
SHI-Labs / Prompt-Free-Diffusion
View on GitHub
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024
☆759Nov 16, 2023Updated 2 years ago
Picsart-AI-Research / PAIR-Diffusion
View on GitHub
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
☆521Apr 2, 2024Updated 2 years ago
timothybrooks / instruct-pix2pix
View on GitHub
☆6,884Mar 3, 2024Updated 2 years ago
frank-xwang / InstanceDiffusion
View on GitHub
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
☆614Jun 17, 2025Updated last year
NeuralTextualInversion / NeTI
View on GitHub
Official Implementation for "A Neural Space-Time Representation for Text-to-Image Personalization" (SIGGRAPH Asia 2023)
☆182Sep 19, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Fantasy-Studio / Paint-by-Example
View on GitHub
Paint by Example: Exemplar-based Image Editing with Diffusion Models
☆1,252Nov 28, 2023Updated 2 years ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,959Aug 15, 2024Updated last year
OPPO-Mente-Lab / Subject-Diffusion
View on GitHub
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
☆317Jul 11, 2024Updated 2 years ago
google / prompt-to-prompt
View on GitHub
☆3,456May 14, 2024Updated 2 years ago
MC-E / DragonDiffusion
View on GitHub
ICLR 2024 (Spotlight)
☆788Mar 2, 2024Updated 2 years ago
YingqingHe / ScaleCrafter
View on GitHub
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
☆507Mar 7, 2024Updated 2 years ago
kfirgoldberg / ConceptLab
View on GitHub
Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"
☆256Dec 19, 2023Updated 2 years ago
RunpeiDong / DreamLLM
View on GitHub
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
☆462Dec 2, 2024Updated last year
google / break-a-scene
View on GitHub
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
☆525Jan 14, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
genforce / freecontrol
View on GitHub
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…
☆480Oct 21, 2024Updated last year
guoqincode / Focus-on-Your-Instruction
View on GitHub
[CVPR 2024] Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
☆116Mar 22, 2024Updated 2 years ago
NVlabs / ODISE
View on GitHub
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
☆945Jul 6, 2024Updated 2 years ago
ShihaoZhaoZSH / Uni-ControlNet
View on GitHub
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
☆670Jul 17, 2024Updated 2 years ago
showlab / BoxDiff
View on GitHub
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
☆275Nov 12, 2024Updated last year
abyildirim / inst-inpaint
View on GitHub
A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.
☆386Dec 9, 2025Updated 7 months ago
gnobitab / InstaFlow
View on GitHub
InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
☆1,408Jun 7, 2024Updated 2 years ago
Tsingularity / dift
View on GitHub
[NeurIPS'23] Emergent Correspondence from Image Diffusion
☆773May 14, 2024Updated 2 years ago
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆772Jan 26, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PRIV-Creation / Awesome-Controllable-T2I-Diffusion-Models
View on GitHub
A collection of resources on controllable generation with text-to-image diffusion models.
☆1,111Dec 31, 2024Updated last year
OpenGVLab / VisionLLM
View on GitHub
VisionLLM Series
☆1,152Feb 27, 2025Updated last year
YangLing0818 / EditWorld
View on GitHub
[ACM Multimedia 2025 Datasets Track] EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
☆141Aug 2, 2025Updated 11 months ago
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,298Oct 31, 2024Updated last year
AILab-CVC / SEED
View on GitHub
Official implementation of SEED-LLaMA (ICLR 2024).
☆642Sep 21, 2024Updated last year
Junyi42 / sd-dino
View on GitHub
Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"
☆356Mar 29, 2024Updated 2 years ago
showlab / Show-o
View on GitHub
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,962Jan 8, 2026Updated 6 months ago