Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
β504Jun 24, 2025Updated 9 months ago
Alternatives and similar repositories for Awesome-Controllable-Diffusion
Users that are interested in Awesome-Controllable-Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 πβ3,591May 7, 2025Updated 11 months ago
- A collection of resources on controllable generation with text-to-image diffusion models.β1,113Dec 31, 2024Updated last year
- Diffusion Model-Based Image Editing: A Survey (TPAMI 2025)β710Jul 15, 2025Updated 9 months ago
- β17Aug 8, 2024Updated last year
- collection of diffusion model papers categorized by their subareasβ2,189Mar 16, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ACL 2023] Reasoning with Language Model Prompting: A Surveyβ1,005May 21, 2025Updated 10 months ago
- [CSUR] A Survey on Video Diffusion Modelsβ2,290Updated this week
- A curated list of recent diffusion models for video generation, editing, and various other applications.β5,581Apr 3, 2026Updated 2 weeks ago
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Modβ¦β365Mar 19, 2025Updated last year
- Latest Advances on Multimodal Large Language Modelsβ17,624Apr 9, 2026Updated last week
- (ΰ·`κ³Β΄ΰ·) A Survey on Text-to-Image Generation/Synthesis.β2,432Feb 7, 2026Updated 2 months ago
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".β2,104Oct 5, 2023Updated 2 years ago
- A collection of resources and papers on Diffusion Modelsβ12,305Aug 1, 2024Updated last year
- Paper List for In-context Learning π·β873Oct 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,195Nov 18, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,252Feb 16, 2025Updated last year
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.β569Nov 13, 2023Updated 2 years ago
- Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance" (NeurIPS 2024)β308Sep 12, 2025Updated 7 months ago
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translatiβ¦β281Apr 8, 2026Updated last week
- A Survey of Image Editingβ468Aug 24, 2025Updated 7 months ago
- A library for advanced large language model reasoningβ2,339Jun 10, 2025Updated 10 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ110Apr 10, 2024Updated 2 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"β1,480May 31, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".β62Jun 27, 2025Updated 9 months ago
- [ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generationβ131Feb 23, 2025Updated last year
- Multimodal-GPTβ1,515Jun 4, 2023Updated 2 years ago
- π₯π₯π₯A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.β166Dec 26, 2024Updated last year
- A reading list of video generationβ698Updated this week
- Research Trends in LLM-guided Multimodal Learning.β356Oct 17, 2023Updated 2 years ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsβ134Feb 27, 2025Updated last year
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answeringβ¦β31Jan 31, 2023Updated 3 years ago
- Emu Series: Generative Multimodal Models from BAAIβ1,773Jan 12, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ECCV 2024] ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.β546Jan 12, 2025Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β6,525Jun 28, 2024Updated last year
- β101May 16, 2024Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".β1,139Dec 23, 2023Updated 2 years ago
- IP Adapter Instructβ210Aug 10, 2024Updated last year
- Awesome papers on Language-Model-as-a-Service (LMaaS)β545May 14, 2024Updated last year
- π₯π₯π₯ A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).β545Apr 4, 2025Updated last year