CLIP+MLP Aesthetic Score Predictor
☆1,265Jul 1, 2024Updated last year
Alternatives and similar repositories for improved-aesthetic-predictor
Users that are interested in improved-aesthetic-predictor are comparing it to the libraries listed below
Sorting:
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆674Aug 15, 2022Updated 3 years ago
- ☆580Dec 21, 2024Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,641Oct 29, 2025Updated 4 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆647May 24, 2024Updated last year
- SigLIP-based Aesthetic Score Predictor☆386Dec 18, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- T2I-Adapter☆3,799Jun 21, 2024Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆667Nov 10, 2025Updated 3 months ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Jul 14, 2023Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆314Nov 1, 2024Updated last year
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆425Jul 29, 2022Updated 3 years ago
- ☆3,052Feb 27, 2023Updated 3 years ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,480Jun 28, 2024Updated last year
- SD Aesthetic Gens Algo☆101Nov 2, 2022Updated 3 years ago
- ☆3,441May 14, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,371Oct 19, 2025Updated 4 months ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,528Mar 22, 2024Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,393May 31, 2024Updated last year
- ☆397Jul 11, 2024Updated last year
- ☆6,927Feb 25, 2026Updated last week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,923Updated this week
- ☆167Sep 5, 2022Updated 3 years ago
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,971Dec 1, 2025Updated 3 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,177Nov 18, 2024Updated last year
- Karras et al. (2022) diffusion models for PyTorch☆2,569Feb 12, 2026Updated 3 weeks ago
- An open source implementation of CLIP.☆13,460Feb 27, 2026Updated last week
- ☆145Jan 16, 2023Updated 3 years ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,513Updated this week
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆764Jan 26, 2024Updated 2 years ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆13,864Feb 29, 2024Updated 2 years ago
- [ICCV 2023] Consistent Image Synthesis and Editing☆840Aug 19, 2024Updated last year
- Description and pointers of laion datasets☆251Nov 5, 2022Updated 3 years ago
- ☆130Jan 10, 2023Updated 3 years ago
- Consistency Distilled Diff VAE☆2,211Nov 7, 2023Updated 2 years ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,897Oct 31, 2024Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,293Nov 27, 2025Updated 3 months ago
- ☆2,230Nov 8, 2024Updated last year
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,615Jun 14, 2024Updated last year