[NeurIPS 2024] π«CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
β168Nov 18, 2024Updated last year
Alternatives and similar repositories for CoMat
Users that are interested in CoMat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationβ271Apr 7, 2025Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,284Jul 17, 2024Updated last year
- (CVPR 2024) π§© TokenCompose: Text-to-Image Diffusion with Token-level Supervisionβ137Dec 21, 2024Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sampβ¦β321Nov 1, 2024Updated last year
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ299Jul 17, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesisβ676May 24, 2024Updated 2 years ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ113Apr 18, 2024Updated 2 years ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Jul 16, 2024Updated last year
- β15Mar 30, 2025Updated last year
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)β348Jul 26, 2024Updated last year
- Scalable group inference for generating high quality and diverse images with diffusion models.β43Aug 31, 2025Updated 10 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).β39May 9, 2024Updated 2 years ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluationβ344May 7, 2026Updated last month
- β237Apr 10, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).β82Jun 11, 2024Updated 2 years ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generationβ33Mar 30, 2025Updated last year
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"β86Jul 23, 2024Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ110Apr 10, 2024Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β105Jul 5, 2024Updated last year
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoTβ432Sep 18, 2025Updated 9 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,840Feb 1, 2025Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"β701Jun 2, 2026Updated 3 weeks ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β128Jul 22, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (β¦β173Feb 27, 2024Updated 2 years ago
- β598Dec 21, 2024Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ27Oct 9, 2025Updated 8 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiencyβ135Aug 5, 2025Updated 10 months ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generationβ15Mar 7, 2026Updated 3 months ago
- β34Dec 29, 2025Updated 6 months ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Conβ¦β480Oct 21, 2024Updated last year
- A collection of resources on controllable generation with text-to-image diffusion models.β1,113Dec 31, 2024Updated last year
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β110Mar 27, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLMβ72Jul 16, 2025Updated 11 months ago
- [CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"β361May 28, 2024Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)β771Jan 26, 2024Updated 2 years ago
- [TMLR] Official PyTorch implementation of "Ξ»-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latentβ¦β53Nov 29, 2024Updated last year
- β133Jul 17, 2024Updated last year
- π€ Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Textβ¦β120Mar 29, 2023Updated 3 years ago
- Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"β265Jul 3, 2024Updated last year