Xiaohui9607 / LLM_layout_generator
LLM as Layout generator designed for improving compositional ability of stable diffusion models
☆15Updated last year
Alternatives and similar repositories for LLM_layout_generator
Users that are interested in LLM_layout_generator are comparing it to the libraries listed below
Sorting:
- ☆20Updated 7 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 3 months ago
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".☆18Updated last year
- ☆18Updated 6 months ago
- ☆10Updated last year
- ☆20Updated last year
- Fast Sprite Decomposition from Animated Graphics [ECCV2024]☆32Updated 7 months ago
- ☆26Updated 2 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆41Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Updated 11 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆16Updated 3 weeks ago
- ☆21Updated last year
- ☆13Updated 2 years ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆23Updated 10 months ago
- A curated list of Text-to-Video Generation papers and BibTeX entries☆19Updated last year
- ☆15Updated last month
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated 3 months ago
- ☆21Updated 4 months ago
- [ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation☆54Updated 7 months ago
- EditWorld: Simulating World Dynamics for Instruction-Following Image Editing☆130Updated 10 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆30Updated 5 months ago
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆28Updated 4 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆54Updated 3 months ago
- ☆28Updated last year
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆48Updated 9 months ago
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆35Updated last month
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆17Updated 7 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆23Updated 9 months ago