CaraJ7 / CoMat
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
☆156Updated 4 months ago
Alternatives and similar repositories for CoMat:
Users that are interested in CoMat are comparing it to the libraries listed below
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆198Updated last week
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆140Updated last month
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆147Updated 2 months ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆186Updated 2 weeks ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆123Updated this week
- Subjects200K dataset☆107Updated 3 months ago
- ☆114Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 9 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆95Updated 11 months ago
- ☆85Updated 6 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆113Updated 8 months ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆118Updated last month
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆85Updated 3 months ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆121Updated last month
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆116Updated 5 months ago
- ☆91Updated 9 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆98Updated 2 months ago
- ☆47Updated 3 months ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆70Updated this week
- ☆48Updated 3 months ago
- [CVPR2024] Official code for Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation☆86Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 9 months ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".☆139Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆99Updated last year
- ☆110Updated last year
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆113Updated 3 months ago
- Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)☆140Updated 10 months ago
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆31Updated 3 weeks ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆123Updated 9 months ago
- ☆85Updated last month