OPPO-Mente-Lab / attention-mask-controlView external linksLinks
code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"
☆46Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for attention-mask-control
Users that are interested in attention-mask-control are comparing it to the libraries listed below
Sorting:
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆81Feb 22, 2024Updated last year
- ☆61Oct 13, 2023Updated 2 years ago
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆38Aug 19, 2023Updated 2 years ago
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆19Feb 5, 2026Updated last week
- ☆12Feb 7, 2023Updated 3 years ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆314Jul 11, 2024Updated last year
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆266Mar 18, 2024Updated last year
- The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".☆51Apr 1, 2024Updated last year
- Official implementation of the paper "Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synth…☆93Oct 2, 2023Updated 2 years ago
- This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layo…☆63May 16, 2024Updated last year
- ☆62Jun 25, 2024Updated last year
- ☆132Jul 17, 2024Updated last year
- Text-To-Image Generation with Chinese Characters☆132Jul 20, 2023Updated 2 years ago
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)☆37Jan 25, 2024Updated 2 years ago
- Virtual try-on for creating a personal brand wardrobe collection.☆16Aug 15, 2024Updated last year
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆16Aug 1, 2025Updated 6 months ago
- ☆93Jul 21, 2023Updated 2 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42May 24, 2023Updated 2 years ago
- [NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆95Dec 3, 2025Updated 2 months ago
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆17May 18, 2023Updated 2 years ago
- ☆24Nov 29, 2023Updated 2 years ago
- Implementation UniTune based on stable diffusion☆40Nov 15, 2022Updated 3 years ago
- Distilling Diversity and Control in Diffusion Models☆50Apr 28, 2025Updated 9 months ago
- ☆24Feb 8, 2025Updated last year
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Aug 23, 2025Updated 5 months ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Apr 24, 2024Updated last year
- Finetune Stable Video Diffusion with Lora☆18Feb 3, 2024Updated 2 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Mar 29, 2023Updated 2 years ago
- Code for our papers : "Generating images of rare concepts using pre-trained diffusion models" (AAAI 24) and "Norm-guided latent space exp…☆86Dec 27, 2023Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆762Jan 26, 2024Updated 2 years ago
- Official pytorch implementation of "Towards Practical Plug-and-Play Diffusion Models" in CVPR2023☆22Jul 22, 2023Updated 2 years ago
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆33Oct 17, 2025Updated 3 months ago
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆93Nov 21, 2025Updated 2 months ago
- ☆56Apr 30, 2024Updated last year
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆86Jul 11, 2024Updated last year
- Training recipe for SpatialReasoner☆37Sep 21, 2025Updated 4 months ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Mar 28, 2024Updated last year
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆96Dec 19, 2023Updated 2 years ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Nov 2, 2024Updated last year