[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
β120Nov 14, 2024Updated last year
Alternatives and similar repositories for RealCompo
Users that are interested in RealCompo are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generationβ203Feb 19, 2025Updated last year
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β109Nov 24, 2025Updated 3 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β128Jul 22, 2024Updated last year
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimizationβ76Jun 7, 2024Updated last year
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)β80Apr 23, 2025Updated 10 months ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]β24Aug 13, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β103Jul 5, 2024Updated last year
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Generβ¦β17Jul 1, 2024Updated last year
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)β26May 23, 2024Updated last year
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (β¦β174Feb 27, 2024Updated 2 years ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,844Feb 1, 2025Updated last year
- [TMLR] Official PyTorch implementation of "Ξ»-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latentβ¦β53Nov 29, 2024Updated last year
- (CVPR 2024) π§© TokenCompose: Text-to-Image Diffusion with Token-level Supervisionβ136Dec 21, 2024Updated last year
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.β19Jun 27, 2024Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)β45Nov 29, 2023Updated 2 years ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Modelβ48Sep 13, 2024Updated last year
- This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layoβ¦β63May 16, 2024Updated last year
- InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)β20Oct 17, 2024Updated last year
- The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)β20Feb 7, 2024Updated 2 years ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"β65May 1, 2024Updated last year
- β67Jun 27, 2024Updated last year
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".β152Dec 28, 2023Updated 2 years ago
- β11Jul 26, 2024Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)β52Jan 14, 2026Updated last month
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animatorβ98Mar 18, 2024Updated last year
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generationβ33Oct 17, 2025Updated 4 months ago
- [ACM Multimedia 2025 Datasets Track] EditWorld: Simulating World Dynamics for Instruction-Following Image Editingβ139Aug 2, 2025Updated 7 months ago
- Repository for the Paper "Multi-LoRA Composition for Image Generation"β491Mar 31, 2024Updated last year
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generationβ72May 24, 2024Updated last year
- β238Apr 10, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Jul 16, 2024Updated last year
- [NeurIPS 2024] Official implementation of "Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models."β55Aug 14, 2025Updated 6 months ago
- β184Oct 28, 2024Updated last year
- [ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffβ¦β234Feb 14, 2025Updated last year
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesisβ104Jan 18, 2024Updated 2 years ago
- ICLR 2024 (Spotlight)β785Mar 2, 2024Updated 2 years ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafterβ427Aug 25, 2025Updated 6 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generationβ232Nov 4, 2024Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,277Jul 17, 2024Updated last year