RockeyCoss / SPOLinks
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
โ211Updated last month
Alternatives and similar repositories for SPO
Users that are interested in SPO are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] ๐ซCoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matchingโ159Updated 6 months ago
- โ97Updated last month
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingโ102Updated last year
- โ87Updated 8 months ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from uโฆโ198Updated 3 weeks ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"โ99Updated 10 months ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generationโ126Updated 2 months ago
- โ241Updated 10 months ago
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimizationโ138Updated 4 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsโ275Updated 5 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceโ272Updated last month
- SigLIP-based Aesthetic Score Predictorโ250Updated 5 months ago
- โ50Updated 5 months ago
- Subjects200K datasetโ111Updated 4 months ago
- ๐ฅ [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)โ176Updated last year
- โ111Updated last year
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generationโ185Updated 3 months ago
- โ116Updated 7 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modelingโ168Updated 8 months ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learningโ245Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsโ141Updated 3 months ago
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carvingโ170Updated this week
- Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)โ207Updated 2 months ago
- The code of our work "Golden Noise for Diffusion Models: A Learning Framework".โ155Updated 3 months ago
- โ151Updated 7 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".โ204Updated last month
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectโ158Updated 2 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]โ88Updated 3 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ125Updated 10 months ago
- โ229Updated this week