CaraJ7 / CoMatLinks
[NeurIPS 2024] π«CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
β166Updated 11 months ago
Alternatives and similar repositories for CoMat
Users that are interested in CoMat are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationβ255Updated 7 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionβ169Updated 2 months ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from uβ¦β209Updated 6 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ111Updated last year
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)β215Updated 2 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]β132Updated 9 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsβ147Updated 8 months ago
- β105Updated last year
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)β136Updated 6 months ago
- Subjects200K datasetβ123Updated 9 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)β97Updated 9 months ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesisβ76Updated 9 months ago
- β51Updated 10 months ago
- β91Updated last year
- β50Updated 10 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Modelsβ116Updated last year
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β125Updated last year
- Conceptrol: Concept Control of Zero-shot Personalized Image Generationβ44Updated 7 months ago
- β112Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β103Updated last year
- β121Updated 2 months ago
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusersβ100Updated 2 years ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".β145Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ105Updated last year
- [NeurIPS 2025 D&Bπ₯] ImgEdit: A Unified Image Editing Dataset and Benchmarkβ224Updated last week
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceβ299Updated 3 months ago
- Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)β222Updated 9 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)β94Updated 8 months ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformerβ115Updated 4 months ago
- [ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generationβ125Updated 8 months ago