Qrange-group / SUR-adapterView external linksLinks
ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities from large language models to build a high-quality textual semantic representation for text-to-image generation.
☆120Sep 4, 2025Updated 5 months ago
Alternatives and similar repositories for SUR-adapter
Users that are interested in SUR-adapter are comparing it to the libraries listed below
Sorting:
- ☆474Jun 20, 2025Updated 7 months ago
- Code of ICCV 2023 paper titled General Image-to-Image Translation with One-Shot Image Guidance☆177Aug 26, 2023Updated 2 years ago
- Unofficial implementation of Sketch-Guided Text-to-Image Diffusion Models☆13Jun 19, 2023Updated 2 years ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆482Sep 9, 2024Updated last year
- Unofficial implementation of Face0 with SDXL☆12Sep 1, 2023Updated 2 years ago
- Text-To-Image Generation with Chinese Characters☆132Jul 20, 2023Updated 2 years ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆168Nov 18, 2024Updated last year
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆79Mar 24, 2025Updated 10 months ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,635Oct 29, 2025Updated 3 months ago
- TerDiT: Ternary Diffusion Models with Transformers☆74Jun 17, 2024Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Jul 14, 2023Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆762Jan 26, 2024Updated 2 years ago
- A fine tune version of Stable Diffusion model on self-translate 10k diffusiondb Chinese Corpus and "extend" it☆32Mar 29, 2023Updated 2 years ago
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆266Mar 18, 2024Updated last year
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆109Jan 23, 2024Updated 2 years ago
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆521Apr 2, 2024Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42May 24, 2023Updated 2 years ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Aug 23, 2025Updated 5 months ago
- ☆63Jul 10, 2025Updated 7 months ago
- I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models☆230Jun 18, 2024Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆661Nov 10, 2025Updated 3 months ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆543Jan 8, 2024Updated 2 years ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆242Apr 6, 2024Updated last year
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆298Aug 29, 2025Updated 5 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆86Jul 13, 2024Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆46Dec 21, 2023Updated 2 years ago
- A collection of resources on controllable generation with text-to-image diffusion models.☆1,111Dec 31, 2024Updated last year
- Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'☆257Oct 11, 2023Updated 2 years ago
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆44May 28, 2024Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,276Jul 17, 2024Updated last year
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆314Jul 11, 2024Updated last year
- The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"☆29Jul 7, 2025Updated 7 months ago
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆25Feb 21, 2025Updated 11 months ago
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆24Oct 2, 2024Updated last year
- Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"☆269Dec 10, 2024Updated last year
- diffusion-based layout-to-image generation model☆325Apr 12, 2025Updated 10 months ago
- [CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model☆769Aug 14, 2024Updated last year
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆500Nov 14, 2023Updated 2 years ago
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models☆313Dec 28, 2023Updated 2 years ago