Xiaohui9607 / LLM_layout_generatorLinks
LLM as Layout generator designed for improving compositional ability of stable diffusion models
☆17Updated last year
Alternatives and similar repositories for LLM_layout_generator
Users that are interested in LLM_layout_generator are comparing it to the libraries listed below
Sorting:
- (ICCV'25) TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models (Au…☆10Updated 3 months ago
- ☆20Updated 10 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 5 months ago
- ☆13Updated 2 years ago
- Fast Sprite Decomposition from Animated Graphics [ECCV2024]☆32Updated 9 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 5 months ago
- DiT for VAE (and Video Generation)☆34Updated 10 months ago
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆29Updated 2 months ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Updated last year
- ☆22Updated 2 months ago
- ☆11Updated last year
- Code for full fintuing Mochi model with FSDP (and CP)☆28Updated 3 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Updated 10 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆23Updated 11 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Updated 5 months ago
- Official Implementation of GrounDiT (NeurIPS 2024)☆54Updated 7 months ago
- [ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation☆55Updated 10 months ago
- ☆4Updated 9 months ago
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".☆18Updated last year
- ☆19Updated 3 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆128Updated last year
- ☆26Updated 4 months ago
- ☆70Updated 9 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆69Updated 7 months ago
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆40Updated last week
- Blending Custom Photos with Video Diffusion Transformers☆47Updated 5 months ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆52Updated last year
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆33Updated last year
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆51Updated 2 weeks ago
- ☆23Updated 8 months ago