[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,843Feb 1, 2025Updated last year
Alternatives and similar repositories for RPG-DiffusionMaster
Users that are interested in RPG-DiffusionMaster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,281Jul 17, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,284Oct 31, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,254Feb 16, 2025Updated last year
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,896Dec 24, 2024Updated last year
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,316Dec 29, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,153Jan 10, 2025Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,502Jun 28, 2024Updated last year
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,612Jun 14, 2024Updated last year
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,927Jul 18, 2024Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆3,006Sep 8, 2024Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,651Oct 29, 2025Updated 5 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆2,006Sep 18, 2024Updated last year
- Transparent Image Layer Diffusion using Latent Transparency☆2,195Jun 16, 2024Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,492Feb 19, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation of AnimateDiff.☆12,085Jul 31, 2024Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,298Nov 27, 2025Updated 4 months ago
- [ICCV 2023] Consistent Image Synthesis and Editing☆843Aug 19, 2024Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,038Jan 9, 2026Updated 2 months ago
- [CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"☆607Jun 17, 2025Updated 9 months ago
- [CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model☆772Aug 14, 2024Updated last year
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…☆477Oct 21, 2024Updated last year
- T2I-Adapter☆3,805Jun 21, 2024Updated last year
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆481Sep 9, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Character Animation (AnimateAnyone, Face Reenactment)☆3,498May 31, 2024Updated last year
- [CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos…☆978Aug 5, 2024Updated last year
- Your image is almost there!☆7,644Jul 26, 2024Updated last year
- PhotoMaker [CVPR 2024]☆10,121Oct 31, 2024Updated last year
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆545Jan 18, 2024Updated 2 years ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆951Nov 13, 2024Updated last year
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆839Jan 7, 2026Updated 2 months ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,843Mar 7, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,940Aug 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,407Sep 26, 2024Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,228Apr 8, 2024Updated last year
- Consistency Distilled Diff VAE☆2,212Nov 7, 2023Updated 2 years ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,909Oct 31, 2024Updated last year
- VideoSys: An easy and efficient system for video generation☆2,020Aug 27, 2025Updated 7 months ago
- More relighting!☆8,389Feb 20, 2025Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,214Mar 6, 2024Updated 2 years ago