OPPO-Mente-Lab/attention-mask-control

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OPPO-Mente-Lab/attention-mask-control)

OPPO-Mente-Lab / attention-mask-control

code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"

☆46

Alternatives and similar repositories for attention-mask-control

Users that are interested in attention-mask-control are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hohonu-vicml / DirectedDiffusion
View on GitHub
Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)
☆82Feb 22, 2024Updated 2 years ago
ExplainableML / ImageSelect
View on GitHub
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
☆27Jul 10, 2023Updated 3 years ago
UCSB-NLP-Chang / Diffusion-SpaceTime-Attn
View on GitHub
Official implementation of the paper "Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synth…
☆93Oct 2, 2023Updated 2 years ago
eslambakr / HRS_benchmark
View on GitHub
☆60Oct 13, 2023Updated 2 years ago
OPPO-Mente-Lab / GlyphDraw
View on GitHub
Text-To-Image Generation with Chinese Characters
☆133Jul 20, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
OPPO-Mente-Lab / Subject-Diffusion
View on GitHub
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
☆317Jul 11, 2024Updated 2 years ago
shape-guided-diffusion / shape-guided-diffusion
View on GitHub
Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024
☆39Aug 19, 2023Updated 2 years ago
OPPO-Mente-Lab / Edit_Everything
View on GitHub
☆92Jul 21, 2023Updated 3 years ago
OPPO-Mente-Lab / TLCM
View on GitHub
Official repo for 【TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps】
☆36Dec 27, 2024Updated last year
Nithin-GK / MaxFusion
View on GitHub
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
☆27Nov 2, 2024Updated last year
OPPO-Mente-Lab / PEA-Diffusion
View on GitHub
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
☆37Oct 28, 2024Updated last year
evinpinar / Attend-and-Excite-diffusers
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
OPPO-Mente-Lab / FaceScore
View on GitHub
Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】
☆84Dec 26, 2024Updated last year
silent-chen / layout-guidance
View on GitHub
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance
☆267Mar 18, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OPPO-Mente-Lab / GlyphDraw2
View on GitHub
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
☆87Jul 11, 2024Updated 2 years ago
boschresearch / Divide-and-Bind
View on GitHub
Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)
☆38Jan 25, 2024Updated 2 years ago
univ-esuty / noisecollage
View on GitHub
This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layo…
☆63May 16, 2024Updated 2 years ago
Attention-Refocusing / attention-refocusing
View on GitHub
☆133Jul 17, 2024Updated 2 years ago
Shuweis / ResMaster
View on GitHub
☆63Jun 25, 2024Updated 2 years ago
shunk031 / training-free-structured-diffusion-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…
☆120Mar 29, 2023Updated 3 years ago
nipunjindal / diffusers-layout-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".
☆42May 24, 2023Updated 3 years ago
rohitgandikota / distillation
View on GitHub
Distilling Diversity and Control in Diffusion Models
☆52Apr 28, 2025Updated last year
EnergyAttention / Energy-Based-CrossAttention
View on GitHub
The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".
☆51Apr 1, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
VITA-Group / Neon
View on GitHub
[ICLR 2026 Oral] Neon: Negative Extrapolation From Self-Training Improves Image Generation
☆25Oct 7, 2025Updated 9 months ago
mengab / Joint-Autoregressive-and-Hierarchical-Priors-for-Learned-Image-Compression
View on GitHub
A personal reimplementation with TensorFlow of NIPS2018 paper: Joint Autoregressive and Hierarchical Priors for Learned Image Compression
☆15Jan 17, 2023Updated 3 years ago
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆771Jan 26, 2024Updated 2 years ago
xuduo35 / UniTune
View on GitHub
Implementation UniTune based on stable diffusion
☆41Nov 15, 2022Updated 3 years ago
dvirsamuel / SeedSelect
View on GitHub
Code for our papers : "Generating images of rare concepts using pre-trained diffusion models" (AAAI 24) and "Norm-guided latent space exp…
☆87Dec 27, 2023Updated 2 years ago
Lenubolim / TextDiff
View on GitHub
Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition
☆25Apr 24, 2024Updated 2 years ago
cvlab-kaist / DiffTrack
View on GitHub
[NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"
☆100Dec 3, 2025Updated 7 months ago
gemlab-vt / CONFORM
View on GitHub
Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models [CVPR 2024]
☆27Oct 7, 2024Updated last year
OPPO-Mente-Lab / Qwen-Image-Pruning
View on GitHub
CVPR 2026 Highlight: Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
☆86Apr 9, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hila-chefer / Conceptor
View on GitHub
Official implementation of the paper The Hidden Language of Diffusion Models
☆78Jan 24, 2024Updated 2 years ago
OPPO-Mente-Lab / X2Edit
View on GitHub
AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning
☆97Nov 21, 2025Updated 8 months ago
mengab / SDTS
View on GitHub
TensorFlow implementation of SDTS in IEEE International Conference on Image Processing (ICIP) 2019
☆18May 25, 2019Updated 7 years ago
VincentDENGP / 3D-LR
View on GitHub
Can 3D Vision-Language Models Truly Understand Natural Language?
☆20Mar 28, 2024Updated 2 years ago
qqingzheng / AI-Self-Training-DPO-SDXL
View on GitHub
Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.
☆66Feb 24, 2024Updated 2 years ago
wfanyue / DPG-T2I-Personalization
View on GitHub
[ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
☆50Jun 17, 2025Updated last year
WisconsinAIVision / visii
View on GitHub
👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)
☆98Dec 19, 2023Updated 2 years ago