CaraJ7 / CoMat
[NeurIPS 2024] ๐ซCoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
โ149Updated 4 months ago
Alternatives and similar repositories for CoMat:
Users that are interested in CoMat are comparing it to the libraries listed below
- Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationโ186Updated 3 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsโ137Updated last month
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingโ92Updated 11 months ago
- โ44Updated 3 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformersโ110Updated 2 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"โ100Updated 8 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationโ97Updated 11 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)โ84Updated 2 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisโ83Updated 8 months ago
- โ82Updated 6 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ123Updated 8 months ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)โ119Updated 4 months ago
- Subjects200K datasetโ103Updated 2 months ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsโ116Updated 3 weeks ago
- โ49Updated 2 months ago
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusersโ98Updated last year
- โ113Updated 5 months ago
- [ICLR2025]โ138Updated last month
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesisโ63Updated last month
- Code for FreeScale, a tuning-free method for higher-resolution visual generationโ118Updated 2 weeks ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionโ142Updated last month
- Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"โ63Updated 2 weeks ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)โ111Updated 8 months ago
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".โ55Updated 6 months ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluationโ243Updated this week
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformersโ50Updated 5 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Modelsโ114Updated 4 months ago
- โ109Updated last year
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'โ102Updated 3 months ago