CaraJ7 / CoMatLinks
[NeurIPS 2024] π«CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
β168Updated last year
Alternatives and similar repositories for CoMat
Users that are interested in CoMat are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationβ262Updated 10 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionβ176Updated 5 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ113Updated last year
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)β141Updated 9 months ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from uβ¦β210Updated 9 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β128Updated last year
- Subjects200K datasetβ129Updated last year
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)β97Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsβ148Updated 11 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]β141Updated last year
- β53Updated last year
- β50Updated last year
- β112Updated last year
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".β152Updated 2 years ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesisβ86Updated last year
- β93Updated last year
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)β266Updated 2 months ago
- β109Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β103Updated last year
- [CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generationβ282Updated last year
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Modelsβ119Updated last year
- β238Updated last year
- Conceptrol: Concept Control of Zero-shot Personalized Image Generationβ45Updated 10 months ago
- [CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"β356Updated last year
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusersβ102Updated 2 years ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generationβ44Updated 7 months ago
- β123Updated 5 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceβ307Updated 6 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ109Updated last year
- Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)β231Updated last year