RockeyCoss / SPO
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
โ206Updated last month
Alternatives and similar repositories for SPO:
Users that are interested in SPO are comparing it to the libraries listed below
- [NeurIPS 2024] ๐ซCoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matchingโ158Updated 5 months ago
- โ94Updated last month
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from uโฆโ194Updated this week
- Code for FreeScale, a tuning-free method for higher-resolution visual generationโ126Updated 2 months ago
- โ86Updated 7 months ago
- โ50Updated 4 months ago
- โ110Updated last year
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generationโ184Updated 2 months ago
- Subjects200K datasetโ110Updated 3 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)โ87Updated 3 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingโ100Updated last year
- โ114Updated 6 months ago
- โ229Updated 9 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".โ203Updated last month
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsโ121Updated 2 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceโ266Updated 3 weeks ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsโ274Updated 5 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"โ99Updated 10 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2โ299Updated 3 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsโ140Updated 2 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignmentโ255Updated 2 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)โ114Updated 9 months ago
- โ34Updated 3 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]โ88Updated 2 months ago
- ๐ฅ [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)โ175Updated last year
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]โ109Updated 3 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modelingโ163Updated 7 months ago
- SigLIP-based Aesthetic Score Predictorโ234Updated 4 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ123Updated 10 months ago
- The code of our work "Golden Noise for Diffusion Models: A Learning Framework".โ154Updated 2 months ago