Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
β506Jun 24, 2025Updated 9 months ago
Alternatives and similar repositories for Awesome-Controllable-Diffusion
Users that are interested in Awesome-Controllable-Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 πβ3,575May 7, 2025Updated 10 months ago
- A collection of resources on controllable generation with text-to-image diffusion models.β1,113Dec 31, 2024Updated last year
- Diffusion Model-Based Image Editing: A Survey (TPAMI 2025)β708Jul 15, 2025Updated 8 months ago
- β17Aug 8, 2024Updated last year
- collection of diffusion model papers categorized by their subareasβ2,171Mar 16, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [ACL 2023] Reasoning with Language Model Prompting: A Surveyβ1,001May 21, 2025Updated 10 months ago
- [CSUR] A Survey on Video Diffusion Modelsβ2,281Mar 14, 2026Updated 2 weeks ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.β5,538Mar 14, 2026Updated 2 weeks ago
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Modβ¦β365Mar 19, 2025Updated last year
- Latest Advances on Multimodal Large Language Modelsβ17,505Mar 20, 2026Updated last week
- (ΰ·`κ³Β΄ΰ·) A Survey on Text-to-Image Generation/Synthesis.β2,430Feb 7, 2026Updated last month
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".β2,101Oct 5, 2023Updated 2 years ago
- A collection of resources and papers on Diffusion Modelsβ12,301Aug 1, 2024Updated last year
- Paper List for In-context Learning π·β873Oct 8, 2024Updated last year
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,194Nov 18, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,254Feb 16, 2025Updated last year
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.β569Nov 13, 2023Updated 2 years ago
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translatiβ¦β279Nov 24, 2025Updated 4 months ago
- Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance" (NeurIPS 2024)β307Sep 12, 2025Updated 6 months ago
- A Survey of Image Editingβ466Aug 24, 2025Updated 7 months ago
- A library for advanced large language model reasoningβ2,339Jun 10, 2025Updated 9 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ109Apr 10, 2024Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"β1,476May 31, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".β61Jun 27, 2025Updated 9 months ago
- [ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generationβ131Feb 23, 2025Updated last year
- Multimodal-GPTβ1,517Jun 4, 2023Updated 2 years ago
- π₯π₯π₯A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.β166Dec 26, 2024Updated last year
- A reading list of video generationβ687Mar 23, 2026Updated last week
- [SIGGRAPH Asia 2025] Official code for "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models."β120Nov 12, 2025Updated 4 months ago
- Research Trends in LLM-guided Multimodal Learning.β356Oct 17, 2023Updated 2 years ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsβ134Feb 27, 2025Updated last year
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answeringβ¦β31Jan 31, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Emu Series: Generative Multimodal Models from BAAIβ1,772Jan 12, 2026Updated 2 months ago
- [ECCV 2024] ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.β544Jan 12, 2025Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β6,502Jun 28, 2024Updated last year
- β101May 16, 2024Updated last year
- IP Adapter Instructβ211Aug 10, 2024Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".β1,140Dec 23, 2023Updated 2 years ago
- Awesome papers on Language-Model-as-a-Service (LMaaS)β545May 14, 2024Updated last year