zhenyuw16 / CompAgent_code
Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".
☆18Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for CompAgent_code
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆44Updated 2 months ago
- Official code base for paper EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guid…☆34Updated last month
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆81Updated 3 weeks ago
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆76Updated 9 months ago
- Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space"☆44Updated 7 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆31Updated last week
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆91Updated 9 months ago
- ☆40Updated 11 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆64Updated last week
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆86Updated this week
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆72Updated this week
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆110Updated last month
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆102Updated 6 months ago
- ☆39Updated 3 months ago
- [ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion☆19Updated 4 months ago
- [ECCV2024] PartCraft: Crafting Creative Objects by Parts☆82Updated last month
- ☆77Updated 2 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 7 months ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago
- EditWorld: Simulating World Dynamics for Instruction-Following Image Editing☆121Updated 4 months ago
- ☆23Updated last year
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆70Updated 3 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- [ICLR 2024] Code for FreeNoise based on AnimateDiff☆106Updated 10 months ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆32Updated 11 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆34Updated last month
- ☆16Updated 3 months ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆61Updated 5 months ago
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆44Updated 3 months ago