[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
☆168Nov 18, 2024Updated last year
Alternatives and similar repositories for CoMat
Users that are interested in CoMat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆270Apr 7, 2025Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,285Jul 17, 2024Updated last year
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆137Dec 21, 2024Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆320Nov 1, 2024Updated last year
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation☆299Jul 17, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆672May 24, 2024Updated 2 years ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆113Apr 18, 2024Updated 2 years ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- ☆15Mar 30, 2025Updated last year
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)☆348Jul 26, 2024Updated last year
- Scalable group inference for generating high quality and diverse images with diffusion models.☆43Aug 31, 2025Updated 9 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆39May 9, 2024Updated 2 years ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆344May 7, 2026Updated last month
- ☆237Apr 10, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Jun 11, 2024Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Mar 30, 2025Updated last year
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆86Jul 23, 2024Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆110Apr 10, 2024Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆105Jul 5, 2024Updated last year
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆431Sep 18, 2025Updated 8 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,842Feb 1, 2025Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆697Jun 2, 2026Updated last week
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆128Jul 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (…☆173Feb 27, 2024Updated 2 years ago
- ☆594Dec 21, 2024Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 8 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆136Aug 5, 2025Updated 10 months ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆14Mar 7, 2026Updated 3 months ago
- ☆34Dec 29, 2025Updated 5 months ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…☆480Oct 21, 2024Updated last year
- A collection of resources on controllable generation with text-to-image diffusion models.☆1,113Dec 31, 2024Updated last year
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆110Mar 27, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆72Jul 16, 2025Updated 10 months ago
- [CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"☆361May 28, 2024Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆771Jan 26, 2024Updated 2 years ago
- [TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…☆53Nov 29, 2024Updated last year
- ☆133Jul 17, 2024Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Mar 29, 2023Updated 3 years ago
- Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"☆265Jul 3, 2024Updated last year