Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".
☆18Jan 30, 2024Updated 2 years ago
Alternatives and similar repositories for CompAgent_code
Users that are interested in CompAgent_code are comparing it to the libraries listed below
Sorting:
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆25Jan 22, 2026Updated last month
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆57Jul 25, 2023Updated 2 years ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Sep 13, 2024Updated last year
- Code for the ACL2023 paper: CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning (https://aclant…☆11May 9, 2023Updated 2 years ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆26Jan 21, 2026Updated 2 months ago
- Official PyTorch implementation of paper MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling☆13Oct 5, 2024Updated last year
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆51Jun 17, 2025Updated 9 months ago
- ☆32Jan 25, 2024Updated 2 years ago
- ☆16May 23, 2023Updated 2 years ago
- OpenThinkIMG is an end-to-end open-source framework that empowers Large Vision-Language Models to think with images.☆117Jul 11, 2025Updated 8 months ago
- ☆12Oct 7, 2024Updated last year
- Code and data for "Does Spatial Cognition Emerge in Frontier Models?"☆27Apr 18, 2025Updated 11 months ago
- ☆34Dec 29, 2025Updated 2 months ago
- InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)☆20Oct 17, 2024Updated last year
- This repository is the code of the paper "DiffusionInst: Diffusion Model for Instance Segmentation".☆28Jan 3, 2023Updated 3 years ago
- ☆18Dec 18, 2023Updated 2 years ago
- ☆10Nov 21, 2023Updated 2 years ago
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆77Jun 7, 2024Updated last year
- Examples of flaky Rails specs along with solutions☆25Jun 1, 2022Updated 3 years ago
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆260Dec 26, 2024Updated last year
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆48Feb 10, 2026Updated last month
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Jan 18, 2024Updated 2 years ago
- ☆13May 22, 2024Updated last year
- Memory-optimized training scripts for video models based on Diffusers☆14Jan 3, 2025Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42May 24, 2023Updated 2 years ago
- ☆39Oct 19, 2024Updated last year
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆110Nov 24, 2025Updated 3 months ago
- Various TeX templates, including slides and posters.☆16May 19, 2022Updated 3 years ago
- [CVPR2020] Tensorflow implementation for paper ''Distribution-induced Bidirectional Generative Adversarial Network for Graph Representati…☆30Nov 24, 2021Updated 4 years ago
- Code for our paper `Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation`☆20Feb 18, 2024Updated 2 years ago
- Shows all your connected webcam feeds right in the browser, in draggable and resizable boxes☆13Jan 27, 2019Updated 7 years ago
- ☆23Oct 20, 2023Updated 2 years ago
- Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey☆21Jul 27, 2025Updated 7 months ago
- [ICCV 2025] Pytorch implementation of "PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single Reference…☆52Mar 16, 2025Updated last year
- ☆11Oct 6, 2022Updated 3 years ago
- Example of using multiple GPUs with PyTorch DataParallel☆12Jan 28, 2020Updated 6 years ago
- [AAAI 2025] CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities☆52Jan 12, 2025Updated last year
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆134Feb 27, 2025Updated last year
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆20May 24, 2025Updated 9 months ago