code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"
☆46Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for attention-mask-control
Users that are interested in attention-mask-control are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆81Feb 22, 2024Updated 2 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- Official implementation of the paper "Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synth…☆93Oct 2, 2023Updated 2 years ago
- ☆61Oct 13, 2023Updated 2 years ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆316Jul 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Text-To-Image Generation with Chinese Characters☆134Jul 20, 2023Updated 2 years ago
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆38Aug 19, 2023Updated 2 years ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Nov 2, 2024Updated last year
- Official repo for 【TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps】☆37Dec 27, 2024Updated last year
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- CVPR2026 Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers☆71Mar 12, 2026Updated 2 weeks ago
- ☆13Feb 7, 2023Updated 3 years ago
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆266Mar 18, 2024Updated 2 years ago
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆84Dec 26, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆96Nov 21, 2025Updated 4 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆88Jul 11, 2024Updated last year
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)☆37Jan 25, 2024Updated 2 years ago
- This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layo…☆63May 16, 2024Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Mar 29, 2023Updated 3 years ago
- ☆133Jul 17, 2024Updated last year
- ☆63Jun 25, 2024Updated last year
- Distilling Diversity and Control in Diffusion Models☆52Apr 28, 2025Updated 11 months ago
- The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".☆51Apr 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A personal reimplementation with TensorFlow of NIPS2018 paper: Joint Autoregressive and Hierarchical Priors for Learned Image Compression☆15Jan 17, 2023Updated 3 years ago
- ☆127Mar 19, 2024Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆768Jan 26, 2024Updated 2 years ago
- Implementation UniTune based on stable diffusion☆40Nov 15, 2022Updated 3 years ago
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆21Oct 24, 2024Updated last year
- Official implementation of the paper The Hidden Language of Diffusion Models☆77Jan 24, 2024Updated 2 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Apr 24, 2024Updated last year
- ☆10Jun 28, 2023Updated 2 years ago
- [NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆97Dec 3, 2025Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TensorFlow implementation of SDTS in IEEE International Conference on Image Processing (ICIP) 2019☆18May 25, 2019Updated 6 years ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Aug 23, 2025Updated 7 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆131May 16, 2025Updated 10 months ago
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆17May 18, 2023Updated 2 years ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Mar 28, 2024Updated 2 years ago
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆96Dec 19, 2023Updated 2 years ago
- AlphaFace: High Fidelity and Real-time Face Swapper Robust to Facial Pose☆41Jan 23, 2026Updated 2 months ago