Qrange-group/SUR-adapter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Qrange-group/SUR-adapter)

Qrange-group / SUR-adapter

ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities from large language models to build a high-quality textual semantic representation for text-to-image generation.

☆120

Alternatives and similar repositories for SUR-adapter

Users that are interested in SUR-adapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TonyLianLong / LLM-groundedDiffusion
View on GitHub
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…
☆482Sep 9, 2024Updated last year
vislearn / ControlNet-XS
View on GitHub
☆472Jun 20, 2025Updated last year
OPPO-Mente-Lab / GlyphDraw
View on GitHub
Text-To-Image Generation with Chinese Characters
☆133Jul 20, 2023Updated 3 years ago
sangminkim-99 / Sketch-Guided-Text-To-Image
View on GitHub
Unofficial implementation of Sketch-Guided Text-to-Image Diffusion Models
☆13Jun 19, 2023Updated 3 years ago
CrystalNeuro / visual-concept-translator
View on GitHub
Code of ICCV 2023 paper titled General Image-to-Image Translation with One-Shot Image Guidance
☆173Aug 26, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CaraJ7 / CoMat
View on GitHub
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
☆169Nov 18, 2024Updated last year
sunyuan-cs / 2024-TKDE-RMCNC
View on GitHub
About PyTorch implementation for ‘’Robust Multi-View Clustering with Noisy Correspondence‘’ (TKDE 2024)
☆11Aug 2, 2024Updated last year
Dshijie / DMVG
View on GitHub
Diffusion-based Missing-view Generation With the Application on Incomplete Multi-view Clustering
☆10May 26, 2024Updated 2 years ago
Guanzhou-Ke / MRDD
View on GitHub
The official repos of "Rethinking Multi-view Representation Learning via Distilled Disentangling"
☆14Apr 3, 2024Updated 2 years ago
tgxs002 / align_sd
View on GitHub
Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
☆293Jul 14, 2023Updated 3 years ago
CodeGoat24 / DreamText
View on GitHub
[CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.
☆82Mar 24, 2025Updated last year
bryandlee / face0-sdxl
View on GitHub
Unofficial implementation of Face0 with SDXL
☆12Sep 1, 2023Updated 2 years ago
zai-org / ImageReward
View on GitHub
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
☆1,694Oct 29, 2025Updated 8 months ago
SalesforceAIResearch / DiffusionDPO
View on GitHub
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆704Jun 2, 2026Updated last month
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Pengchengpcx / FTEdit
View on GitHub
[CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
☆26Aug 23, 2025Updated 10 months ago
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆771Jan 26, 2024Updated 2 years ago
silent-chen / layout-guidance
View on GitHub
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance
☆267Mar 18, 2024Updated 2 years ago
wangemm / AGCL-TMM-2022
View on GitHub
Code of Graph Contrastive Partial Multi-View Clustering
☆12Mar 10, 2025Updated last year
li-shuxian / TME
View on GitHub
[CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".
☆27Jun 9, 2025Updated last year
ZhangqiJiang07 / code_DIMvLN
View on GitHub
[AAAI 2024] PyTorch implementation for Deep Incomplete Multi-View Learning Network with Insufficient Label Information
☆16Mar 17, 2025Updated last year
ygtxr1997 / CelebBasis
View on GitHub
Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'
☆255Oct 11, 2023Updated 2 years ago
csyxwei / ELITE
View on GitHub
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
☆541Jan 8, 2024Updated 2 years ago
yk7333 / d3po
View on GitHub
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
☆244Apr 6, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
calisolo / Levels_image_captioning_NICE
View on GitHub
NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU
☆11Jun 22, 2023Updated 3 years ago
duyguceylan / pix2video
View on GitHub
Code for the paper "Pix2Video: Video Editing using Image Diffusion"
☆77Oct 2, 2023Updated 2 years ago
boyuh / AUCSeg
View on GitHub
This repository is the official code for the paper "AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation" (NeurIPS 2024).
☆14Sep 17, 2025Updated 10 months ago
salesforce / HIVE
View on GitHub
☆121Jun 2, 2026Updated last month
Picsart-AI-Research / PAIR-Diffusion
View on GitHub
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
☆521Apr 2, 2024Updated 2 years ago
hananshafi / llmblueprint
View on GitHub
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
☆85May 18, 2024Updated 2 years ago
jacklishufan / InstructAny2Pix
View on GitHub
PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following
☆33Jan 24, 2025Updated last year
Attention-Refocusing / attention-refocusing
View on GitHub
☆133Jul 17, 2024Updated 2 years ago
PRIV-Creation / Awesome-Controllable-T2I-Diffusion-Models
View on GitHub
A collection of resources on controllable generation with text-to-image diffusion models.
☆1,111Dec 31, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
MIGHTYEZ / Inversion-DPO
View on GitHub
☆19Jul 22, 2025Updated 11 months ago
5Martina5 / FMCSC
View on GitHub
Official code and datas for "Bridging Gaps: Federated Multi-View Clustering in Heterogeneous Hybrid Views". (NeurIPS 2024)
☆17Oct 13, 2024Updated last year
Mowenyii / PAE
View on GitHub
[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation
☆87Jul 13, 2024Updated 2 years ago
huggingface / amused
View on GitHub
☆89Jan 4, 2024Updated 2 years ago
zwl666666 / infusion
View on GitHub
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
☆14Dec 19, 2025Updated 7 months ago
ZichengDuan / TheChosenOne
View on GitHub
Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"
☆270Dec 10, 2024Updated last year
Sealical / anywhere-multi-agent
View on GitHub
AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
☆46May 28, 2024Updated 2 years ago