[NeurIPS 2024] π«CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
β169Nov 18, 2024Updated last year
Alternatives and similar repositories for CoMat
Users that are interested in CoMat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationβ266Apr 7, 2025Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,281Jul 17, 2024Updated last year
- (CVPR 2024) π§© TokenCompose: Text-to-Image Diffusion with Token-level Supervisionβ137Dec 21, 2024Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sampβ¦β314Nov 1, 2024Updated last year
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ299Jul 17, 2024Updated last year
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesisβ657May 24, 2024Updated last year
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ113Apr 18, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Jul 16, 2024Updated last year
- β15Mar 30, 2025Updated last year
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)β349Jul 26, 2024Updated last year
- Scalable group inference for generating high quality and diverse images with diffusion models.β42Aug 31, 2025Updated 7 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).β40May 9, 2024Updated last year
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluationβ336Dec 24, 2025Updated 3 months ago
- β237Apr 10, 2024Updated 2 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).β82Jun 11, 2024Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generationβ33Mar 30, 2025Updated last year
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"β85Jul 23, 2024Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ110Apr 10, 2024Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β104Jul 5, 2024Updated last year
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoTβ432Sep 18, 2025Updated 6 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,843Feb 1, 2025Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"β677Nov 10, 2025Updated 5 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β128Jul 22, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (β¦β174Feb 27, 2024Updated 2 years ago
- β584Dec 21, 2024Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ27Oct 9, 2025Updated 6 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiencyβ138Aug 5, 2025Updated 8 months ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generationβ13Mar 7, 2026Updated last month
- β34Dec 29, 2025Updated 3 months ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Conβ¦β478Oct 21, 2024Updated last year
- A collection of resources on controllable generation with text-to-image diffusion models.β1,114Dec 31, 2024Updated last year
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β110Mar 27, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLMβ72Jul 16, 2025Updated 8 months ago
- [CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"β358May 28, 2024Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)β768Jan 26, 2024Updated 2 years ago
- [TMLR] Official PyTorch implementation of "Ξ»-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latentβ¦β53Nov 29, 2024Updated last year
- β133Jul 17, 2024Updated last year
- π€ Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Textβ¦β120Mar 29, 2023Updated 3 years ago
- Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"β263Jul 3, 2024Updated last year