RockeyCoss / SPO
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
โ176Updated last month
Alternatives and similar repositories for SPO:
Users that are interested in SPO are comparing it to the libraries listed below
- [NeurIPS 2024] ๐ซCoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matchingโ144Updated 2 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"โ99Updated 7 months ago
- โ81Updated 4 months ago
- โ110Updated 4 months ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generationโ114Updated last month
- โ47Updated last month
- GenEval: An object-focused framework for evaluating text-to-image alignmentโ170Updated 6 months ago
- โ109Updated 11 months ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".โ133Updated last year
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ121Updated 7 months ago
- Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"โ215Updated 7 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".โ194Updated this week
- Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"โ186Updated 3 months ago
- [SIGGRAPH Asia 2024 (Journal Track)]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapterโ215Updated 7 months ago
- The implementation of the paper "Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention" (NeurIPS`โฆโ115Updated 4 months ago
- ๐ฅ [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)โ166Updated 10 months ago
- The code of our work "Golden Noise for Diffusion Models: A Learning Framework".โ104Updated last week
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformersโ49Updated 4 months ago
- โ206Updated 6 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]โ78Updated 2 weeks ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)โ81Updated 3 weeks ago
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"โ105Updated 3 months ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)โ118Updated 3 months ago
- โ38Updated last month
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingโ86Updated 9 months ago
- SigLIP-based Aesthetic Score Predictorโ185Updated last month
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformersโ103Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsโ135Updated 7 months ago
- โ132Updated 3 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".โ117Updated 3 weeks ago