CaraJ7/CoMat

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CaraJ7/CoMat)

CaraJ7 / CoMat

[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

☆169

Alternatives and similar repositories for CoMat

Users that are interested in CoMat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RockeyCoss / SPO
View on GitHub
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
☆271Apr 7, 2025Updated last year
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,285Jul 17, 2024Updated 2 years ago
mlpc-ucsd / TokenCompose
View on GitHub
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
☆137Dec 21, 2024Updated last year
ShihaoZhaoZSH / LaVi-Bridge
View on GitHub
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
☆300Jul 17, 2024Updated 2 years ago
fusiming3 / MARS
View on GitHub
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
☆86Jul 16, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mihirp1998 / AlignProp
View on GitHub
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…
☆324Nov 1, 2024Updated last year
UCSC-VLAA / HQ-Edit
View on GitHub
[ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
☆114Apr 18, 2024Updated 2 years ago
yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
tgxs002 / HPSv2
View on GitHub
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆677May 24, 2024Updated 2 years ago
cosmicman-cvpr2024 / CosmicMan
View on GitHub
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
☆348Jul 26, 2024Updated last year
Shentao-YANG / Dense_Reward_T2I
View on GitHub
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
☆39May 9, 2024Updated 2 years ago
CaraJ7 / DraCo
View on GitHub
Offical Repository for Paper: DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
☆17Dec 7, 2025Updated 7 months ago
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆346May 7, 2026Updated 2 months ago
ali-vilab / Ranni
View on GitHub
☆237Apr 10, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LgQu / DPT-T2I
View on GitHub
Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation
☆33Mar 30, 2025Updated last year
mapo-t2i / mapo
View on GitHub
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
☆83Jun 11, 2024Updated 2 years ago
Monalissaa / DisenDiff
View on GitHub
[CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization
☆111Apr 10, 2024Updated 2 years ago
pOpsPaper / pOps
View on GitHub
Official implementation for "pOps: Photo-Inspired Diffusion Operators"
☆86Jul 23, 2024Updated last year
SPRIGHT-T2I / SPRIGHT
View on GitHub
[ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"
☆105Jul 5, 2024Updated 2 years ago
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,838Feb 1, 2025Updated last year
CaraJ7 / T2I-R1
View on GitHub
[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
☆433Sep 18, 2025Updated 10 months ago
SalesforceAIResearch / DiffusionDPO
View on GitHub
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆704Jun 2, 2026Updated last month
Xiaojiu-z / SSR_Encoder
View on GitHub
Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)
☆128Jul 22, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
cvlab-kaist / DreamMatcher
View on GitHub
Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (…
☆174Feb 27, 2024Updated 2 years ago
lmbxmu / CutDiffusion
View on GitHub
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method
☆27Oct 9, 2025Updated 9 months ago
yuvalkirstain / PickScore
View on GitHub
☆600Dec 21, 2024Updated last year
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
genforce / freecontrol
View on GitHub
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…
☆480Oct 21, 2024Updated last year
poppuppy / SAR
View on GitHub
☆34Dec 29, 2025Updated 6 months ago
MME-Benchmarks / MME-CoT
View on GitHub
MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency
☆136Aug 5, 2025Updated 11 months ago
louisYen / Gen4Gen
View on GitHub
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
☆110Mar 27, 2026Updated 3 months ago
TempleX98 / EasyRef
View on GitHub
[ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
☆73Jul 16, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆771Jan 26, 2024Updated 2 years ago
PRIV-Creation / Awesome-Controllable-T2I-Diffusion-Models
View on GitHub
A collection of resources on controllable generation with text-to-image diffusion models.
☆1,111Dec 31, 2024Updated last year
sled-group / InfEdit
View on GitHub
[CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"
☆362May 28, 2024Updated 2 years ago
eclipse-t2i / lambda-eclipse-inference
View on GitHub
[TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…
☆53Nov 29, 2024Updated last year
garibida / ReNoise-Inversion
View on GitHub
Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"
☆265Jul 3, 2024Updated 2 years ago
Attention-Refocusing / attention-refocusing
View on GitHub
☆133Jul 17, 2024Updated 2 years ago
xtudbxk / FreCaS
View on GitHub
The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"
☆32Jul 7, 2025Updated last year