Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
β505Jun 24, 2025Updated 11 months ago
Alternatives and similar repositories for Awesome-Controllable-Diffusion
Users that are interested in Awesome-Controllable-Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 πβ3,622Apr 20, 2026Updated last month
- A collection of resources on controllable generation with text-to-image diffusion models.β1,115Dec 31, 2024Updated last year
- Diffusion Model-Based Image Editing: A Survey (TPAMI 2025)β712Jul 15, 2025Updated 10 months ago
- β17Aug 8, 2024Updated last year
- collection of diffusion model papers categorized by their subareasβ2,206Mar 16, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ACL 2023] Reasoning with Language Model Prompting: A Surveyβ1,007May 21, 2025Updated last year
- [CSUR] A Survey on Video Diffusion Modelsβ2,293Apr 15, 2026Updated last month
- A curated list of recent diffusion models for video generation, editing, and various other applications.β5,663May 8, 2026Updated 3 weeks ago
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Modβ¦β372Mar 19, 2025Updated last year
- Latest Advances on Multimodal Large Language Modelsβ17,829May 1, 2026Updated 3 weeks ago
- (ΰ·`κ³Β΄ΰ·) A Survey on Text-to-Image Generation/Synthesis.β2,435Feb 7, 2026Updated 3 months ago
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".β2,104Oct 5, 2023Updated 2 years ago
- A collection of resources and papers on Diffusion Modelsβ12,328Aug 1, 2024Updated last year
- Paper List for In-context Learning π·β876Oct 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,224Nov 18, 2024Updated last year
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.β569Nov 13, 2023Updated 2 years ago
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,252Feb 16, 2025Updated last year
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translatiβ¦β287Apr 8, 2026Updated last month
- Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance" (NeurIPS 2024)β307Sep 12, 2025Updated 8 months ago
- A Survey of Image Editingβ472Aug 24, 2025Updated 9 months ago
- A library for advanced large language model reasoningβ2,343Jun 10, 2025Updated 11 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ110Apr 10, 2024Updated 2 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"β1,483May 31, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".β62Jun 27, 2025Updated 11 months ago
- [ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generationβ135Feb 23, 2025Updated last year
- π₯π₯π₯A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.β169Dec 26, 2024Updated last year
- Multimodal-GPTβ1,515Jun 4, 2023Updated 2 years ago
- A reading list of video generationβ713May 8, 2026Updated 3 weeks ago
- Research Trends in LLM-guided Multimodal Learning.β356Oct 17, 2023Updated 2 years ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsβ135Feb 27, 2025Updated last year
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answeringβ¦β31Jan 31, 2023Updated 3 years ago
- Emu Series: Generative Multimodal Models from BAAIβ1,775Jan 12, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ECCV 2024] ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.β547Jan 12, 2025Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β6,584Jun 28, 2024Updated last year
- β101May 16, 2024Updated 2 years ago
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".β1,138Dec 23, 2023Updated 2 years ago
- IP Adapter Instructβ212Aug 10, 2024Updated last year
- Awesome papers on Language-Model-as-a-Service (LMaaS)β545May 14, 2024Updated 2 years ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Conβ¦β480Oct 21, 2024Updated last year