CaraJ7 / CoMat
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
☆156Updated 5 months ago
Alternatives and similar repositories for CoMat:
Users that are interested in CoMat are comparing it to the libraries listed below
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆206Updated last month
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆194Updated this week
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆116Updated last month
- ☆94Updated last month
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆114Updated 9 months ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆66Updated 3 months ago
- ☆95Updated 9 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆107Updated 3 months ago
- experimental implementation of Consistory☆19Updated 9 months ago
- ☆110Updated last year
- ☆50Updated 4 months ago
- ☆86Updated 7 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆99Updated last year
- Subjects200K dataset☆110Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 9 months ago
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆37Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆140Updated 2 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 10 months ago
- ☆91Updated 2 weeks ago
- ☆48Updated 4 months ago
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing☆88Updated last month
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆125Updated 3 weeks ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆158Updated last year
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆150Updated 3 months ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer☆79Updated last month
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆116Updated 3 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆123Updated 10 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆101Updated last year
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆116Updated 5 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆87Updated 3 months ago