[NeurIPS 2024] π«CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
β169Nov 18, 2024Updated last year
Alternatives and similar repositories for CoMat
Users that are interested in CoMat are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationβ265Apr 7, 2025Updated 11 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,281Jul 17, 2024Updated last year
- (CVPR 2024) π§© TokenCompose: Text-to-Image Diffusion with Token-level Supervisionβ136Dec 21, 2024Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sampβ¦β314Nov 1, 2024Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesisβ649May 24, 2024Updated last year
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ299Jul 17, 2024Updated last year
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ113Apr 18, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Jul 16, 2024Updated last year
- β15Mar 30, 2025Updated 11 months ago
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)β349Jul 26, 2024Updated last year
- Scalable group inference for generating high quality and diverse images with diffusion models.β42Aug 31, 2025Updated 6 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).β40May 9, 2024Updated last year
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluationβ334Dec 24, 2025Updated 2 months ago
- β238Apr 10, 2024Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).β82Jun 11, 2024Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generationβ33Mar 30, 2025Updated 11 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"β85Jul 23, 2024Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ109Apr 10, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β103Jul 5, 2024Updated last year
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoTβ432Sep 18, 2025Updated 6 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"β671Nov 10, 2025Updated 4 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,843Feb 1, 2025Updated last year
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β128Jul 22, 2024Updated last year
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (β¦β174Feb 27, 2024Updated 2 years ago
- β582Dec 21, 2024Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ27Oct 9, 2025Updated 5 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiencyβ136Aug 5, 2025Updated 7 months ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generationβ11Mar 7, 2026Updated 2 weeks ago
- β34Dec 29, 2025Updated 2 months ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Conβ¦β477Oct 21, 2024Updated last year
- A collection of resources on controllable generation with text-to-image diffusion models.β1,113Dec 31, 2024Updated last year
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β110Nov 24, 2025Updated 3 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLMβ71Jul 16, 2025Updated 8 months ago
- [CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"β358May 28, 2024Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)β767Jan 26, 2024Updated 2 years ago
- [TMLR] Official PyTorch implementation of "Ξ»-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latentβ¦β53Nov 29, 2024Updated last year
- β133Jul 17, 2024Updated last year
- π€ Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Textβ¦β120Mar 29, 2023Updated 2 years ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)β80Apr 23, 2025Updated 10 months ago