discus0434 / aesthetic-predictor-v2-5Links
SigLIP-based Aesthetic Score Predictor
β280Updated 6 months ago
Alternatives and similar repositories for aesthetic-predictor-v2-5
Users that are interested in aesthetic-predictor-v2-5 are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationβ234Updated 3 months ago
- CSGO: Content-Style Composition in Text-to-Image Generation π₯β355Updated 10 months ago
- Implicit Style-Content Separation using B-LoRAβ381Updated 8 months ago
- β176Updated last year
- β112Updated last year
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceβ284Updated last week
- Official Repository of the paper "Trajectory Consistency Distillation"β343Updated last year
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".β207Updated 3 months ago
- β155Updated 8 months ago
- β233Updated last year
- IP Adapter Instructβ206Updated 11 months ago
- Consistency Distillation with Target Timestep Selection and Decoupled Guidanceβ83Updated 6 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Controlβ187Updated 6 months ago
- [NeurIPS 2024] π«CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matchingβ158Updated 7 months ago
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)β341Updated 11 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseenβ¦β246Updated last month
- β250Updated 11 months ago
- Official code for ICCV 205 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillaβ¦β74Updated 2 weeks ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!β403Updated 4 months ago
- Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance" (NeurIPS 2024)β291Updated 3 weeks ago
- Subjects200K datasetβ114Updated 5 months ago
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR β¦β450Updated 5 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesisβ537Updated last year
- Personalize Anything for Free with Diffusion Transformerβ334Updated 3 months ago
- Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"β242Updated last year
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Modelsβ205Updated last year
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!β483Updated 7 months ago
- β50Updated 6 months ago
- [TOG 2024]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapterβ242Updated 3 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2β302Updated 5 months ago