CaraJ7 / CoMat
[NeurIPS 2024] ๐ซCoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
โ145Updated 3 months ago
Alternatives and similar repositories for CoMat:
Users that are interested in CoMat are comparing it to the libraries listed below
- Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationโ178Updated 2 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingโ86Updated 10 months ago
- โ39Updated 2 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)โ105Updated 6 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionโ137Updated 3 weeks ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformersโ102Updated last month
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)โ118Updated 3 months ago
- โ109Updated 11 months ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsโ113Updated 8 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"โ99Updated 7 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsโ136Updated this week
- โ81Updated 4 months ago
- [ICLR2025]โ137Updated 3 weeks ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".โ133Updated last year
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]โ79Updated 3 weeks ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ121Updated 7 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisโ83Updated 7 months ago
- โ110Updated 4 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationโ93Updated 10 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion โฆโ157Updated 10 months ago
- โ47Updated last month
- Code for FreeScale, a tuning-free method for higher-resolution visual generationโ114Updated last month
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".โ117Updated last month
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesisโ58Updated 2 weeks ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)โ82Updated last month
- Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"โ55Updated 2 weeks ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformersโ49Updated 4 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'โ102Updated 2 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Modelsโ112Updated 3 months ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluationโ234Updated 2 weeks ago