[NeurIPS 2024] π«CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
β168Nov 18, 2024Updated last year
Alternatives and similar repositories for CoMat
Users that are interested in CoMat are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationβ265Apr 7, 2025Updated 10 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ113Apr 18, 2024Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sampβ¦β312Nov 1, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Jul 16, 2024Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,277Jul 17, 2024Updated last year
- (CVPR 2024) π§© TokenCompose: Text-to-Image Diffusion with Token-level Supervisionβ136Dec 21, 2024Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).β40May 9, 2024Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).β82Jun 11, 2024Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesisβ646May 24, 2024Updated last year
- β15Mar 30, 2025Updated 11 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ298Jul 17, 2024Updated last year
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"β85Jul 23, 2024Updated last year
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)β349Jul 26, 2024Updated last year
- β238Apr 10, 2024Updated last year
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluationβ331Dec 24, 2025Updated 2 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"β666Nov 10, 2025Updated 3 months ago
- β34Dec 29, 2025Updated 2 months ago
- β11Nov 30, 2025Updated 3 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformersβ65Oct 16, 2024Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLMβ72Jul 16, 2025Updated 7 months ago
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β109Nov 24, 2025Updated 3 months ago
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (β¦β174Feb 27, 2024Updated 2 years ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generationβ33Mar 30, 2025Updated 11 months ago
- Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"β262Jul 3, 2024Updated last year
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β128Jul 22, 2024Updated last year
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]β141Jan 27, 2025Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β103Jul 5, 2024Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ109Apr 10, 2024Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ27Oct 9, 2025Updated 4 months ago
- A collection of resources on controllable generation with text-to-image diffusion models.β1,112Dec 31, 2024Updated last year
- β578Dec 21, 2024Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,844Feb 1, 2025Updated last year
- Scalable group inference for generating high quality and diverse images with diffusion models.β42Aug 31, 2025Updated 6 months ago
- β184Oct 28, 2024Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScopeβ¦β309Mar 12, 2025Updated 11 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with tβ¦β153Jun 25, 2024Updated last year
- The official repo of continuous speculative decodingβ31Mar 28, 2025Updated 11 months ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafterβ427Aug 25, 2025Updated 6 months ago
- [CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"β281Jul 5, 2025Updated 7 months ago