[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
β121Nov 14, 2024Updated last year
Alternatives and similar repositories for RealCompo
Users that are interested in RealCompo are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generationβ204Feb 19, 2025Updated last year
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β110Nov 24, 2025Updated 3 months ago
- [TMLR] Official PyTorch implementation of "Ξ»-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latentβ¦β53Nov 29, 2024Updated last year
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimizationβ77Jun 7, 2024Updated last year
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)β80Apr 23, 2025Updated 10 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,843Feb 1, 2025Updated last year
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"β65May 1, 2024Updated last year
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β128Jul 22, 2024Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]β24Aug 13, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β103Jul 5, 2024Updated last year
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animatorβ98Mar 18, 2024Updated 2 years ago
- (CVPR 2024) π§© TokenCompose: Text-to-Image Diffusion with Token-level Supervisionβ136Dec 21, 2024Updated last year
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.β19Jun 27, 2024Updated last year
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)β27May 23, 2024Updated last year
- β238Apr 10, 2024Updated last year
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesisβ104Jan 18, 2024Updated 2 years ago
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (β¦β174Feb 27, 2024Updated 2 years ago
- This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layoβ¦β63May 16, 2024Updated last year
- β11Jul 26, 2024Updated last year
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Modelsβ14Jan 28, 2023Updated 3 years ago
- [ACM Multimedia 2025 Datasets Track] EditWorld: Simulating World Dynamics for Instruction-Following Image Editingβ140Aug 2, 2025Updated 7 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generationβ240Nov 4, 2024Updated last year
- β66Jun 27, 2024Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)β45Nov 29, 2023Updated 2 years ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generationβ73May 24, 2024Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,281Jul 17, 2024Updated last year
- InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)β20Oct 17, 2024Updated last year
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".β152Dec 28, 2023Updated 2 years ago
- [CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Promptsβ311Jun 9, 2024Updated last year
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Modelβ48Sep 13, 2024Updated last year
- Repository for the Paper "Multi-LoRA Composition for Image Generation"β492Mar 31, 2024Updated last year
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"β47Jul 4, 2024Updated last year
- [TOG 2024]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapterβ267Apr 5, 2025Updated 11 months ago
- Official implementation of the paper "MotionCrafter: One-Shot Motion Customization of Diffusion Models"β29Jan 4, 2024Updated 2 years ago
- ICLR 2024 (Spotlight)β786Mar 2, 2024Updated 2 years ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"β32Nov 30, 2025Updated 3 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generationβ81Apr 10, 2024Updated last year
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]β260Jul 1, 2024Updated last year
- β94Apr 21, 2025Updated 11 months ago