yuli0103 / LayoutDiT
LayoutDiT: Exploring Content-Graphic Balance in Layout Generation with Diffusion Transformer
☆42Updated 3 months ago
Alternatives and similar repositories for LayoutDiT:
Users that are interested in LayoutDiT are comparing it to the libraries listed below
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆47Updated 5 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 9 months ago
- Official Implementation of VideoDPO☆84Updated 3 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆70Updated 3 weeks ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆100Updated last year
- Continuous diffusion for layout generation☆42Updated 2 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆69Updated 9 months ago
- [ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion☆37Updated 9 months ago
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆42Updated last week
- This repo contains the code for PreciseControl project [ECCV'24]☆60Updated 6 months ago
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆103Updated 2 weeks ago
- ☆48Updated 4 months ago
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆32Updated 3 weeks ago
- EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆21Updated last month
- ☆29Updated 5 months ago
- EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆54Updated 2 weeks ago
- An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”☆34Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆42Updated last year
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆73Updated 3 weeks ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆52Updated last month
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models