Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)
☆26May 23, 2024Updated last year
Alternatives and similar repositories for BoxDiff-XL
Users that are interested in BoxDiff-XL are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion☆275Nov 12, 2024Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆137May 4, 2024Updated last year
- Improving word mover’s distance by leveraging self-attention matrix (Published in EMNLP 2023 Findings)☆10Jun 17, 2025Updated 8 months ago
- DDS: Delta Denoising Score PyTorch implementation☆19Sep 2, 2023Updated 2 years ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆120Nov 14, 2024Updated last year
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆18Jun 3, 2024Updated last year
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆266Mar 18, 2024Updated last year
- ☆25Nov 30, 2023Updated 2 years ago
- Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation☆46Jun 1, 2024Updated last year
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆24Oct 2, 2024Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- A SDXL compatible T2I-adapter implementation using Diffusers including a training script☆27Aug 3, 2023Updated 2 years ago
- Deep Comprehensible Color Filter Learning Framework for High-Resolution Image Harmonization (ECCV 22)☆52May 17, 2023Updated 2 years ago
- Official implementation of "Perturbed-Attention Guidance"☆60Jul 2, 2024Updated last year
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆67Sep 26, 2024Updated last year
- Code for Stable Control Representations☆26Apr 5, 2025Updated 10 months ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆30Apr 27, 2024Updated last year
- ICCV2023-Diffusion-Papers☆108Sep 3, 2023Updated 2 years ago
- a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆75Feb 7, 2026Updated 3 weeks ago
- Code for Ray Conditioning☆30Feb 9, 2024Updated 2 years ago
- [CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"☆607Jun 17, 2025Updated 8 months ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Dec 12, 2024Updated last year
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023☆135Nov 8, 2023Updated 2 years ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆136Dec 21, 2024Updated last year
- A novel zero-shot image harmonization method based on Diffusion Model Prior.☆147Nov 25, 2025Updated 3 months ago
- Implementation of InstructEdit☆76Oct 30, 2023Updated 2 years ago
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆76Jun 7, 2024Updated last year
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- A modern audio editor with multitrack capabilities, enhanced waveform visualization, and an intuitive, sleek interface.☆17Aug 12, 2025Updated 6 months ago
- This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).☆12Dec 28, 2024Updated last year
- [SIGGRAPH 2025] Official implementation of 'Motion Inversion For Video Customization'☆153Oct 22, 2024Updated last year
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆47Sep 11, 2024Updated last year
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆35Dec 5, 2022Updated 3 years ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆42Mar 11, 2025Updated 11 months ago
- Make your Turtlebot2 run on ROS Melodic (Ubuntu 18.04).☆10Jul 2, 2021Updated 4 years ago
- ☆13Oct 4, 2024Updated last year
- [KDD Explore'24]Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities☆17May 7, 2025Updated 9 months ago
- A library to query heterogeneous data sources uniformly using SPARQL☆12Dec 5, 2023Updated 2 years ago