yuli0103 / LayoutDiTLinks
LayoutDiT: Exploring Content-Graphic Balance in Layout Generation with Diffusion Transformer
β49Updated last month
Alternatives and similar repositories for LayoutDiT
Users that are interested in LayoutDiT are comparing it to the libraries listed below
Sorting:
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedbackβ59Updated last year
- [NeurIPS 2025 D&Bπ₯] ImgEdit: A Unified Image Editing Dataset and Benchmarkβ275Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Updated last year
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.β85Updated 9 months ago
- [ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?β48Updated last week
- β27Updated 9 months ago
- Unified layout planning and image generation, ICCV2025β40Updated 3 weeks ago
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generationβ104Updated 3 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]β110Updated 2 months ago
- Official repository for "PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation" (CVPβ¦β19Updated 2 months ago
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"β160Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPTβ136Updated last year
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Googleβ63Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generationβ33Updated 10 months ago
- EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modelingβ209Updated this week
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editingβ17Updated 10 months ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Modelsβ74Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ109Updated last year
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ113Updated last year
- Official Implementation of VideoDPOβ160Updated 8 months ago
- Unified Multi-modal IAA Baseline and Benchmarkβ92Updated last year
- β53Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"β46Updated 2 years ago
- [ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generationβ130Updated 11 months ago
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Followingβ31Updated last year
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionβ176Updated 5 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generationβ86Updated last year
- β119Updated last year
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilitiesβ¦β120Updated 5 months ago
- Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]β33Updated last year