A linear estimator on top of clip to predict the aesthetic quality of pictures
☆700Aug 15, 2022Updated 3 years ago
Alternatives and similar repositories for aesthetic-predictor
Users that are interested in aesthetic-predictor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CLIP+MLP Aesthetic Score Predictor☆1,298Jul 1, 2024Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,672Oct 29, 2025Updated 6 months ago
- ☆593Dec 21, 2024Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆671May 24, 2024Updated 2 years ago
- Description and pointers of laion datasets☆255Nov 5, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆426Jul 29, 2022Updated 3 years ago
- ☆166Sep 5, 2022Updated 3 years ago
- SigLIP-based Aesthetic Score Predictor☆412Dec 18, 2024Updated last year
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,418Oct 19, 2025Updated 7 months ago
- Home of `erlich` and `ongo`. Finetune latent-diffusion/glid-3-xl text2image on your own data.☆181Aug 5, 2022Updated 3 years ago
- CLIP-based aesthetics predictor inspired by the interface of 🤗 huggingface transformers.☆45Aug 8, 2025Updated 9 months ago
- ☆3,449May 14, 2024Updated 2 years ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,299Oct 31, 2024Updated last year
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,975Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆295Jul 14, 2023Updated 2 years ago
- ☆3,053Feb 27, 2023Updated 3 years ago
- ☆131Jan 10, 2023Updated 3 years ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆693Nov 10, 2025Updated 6 months ago
- 1.4B latent diffusion model fine tuning☆265May 16, 2022Updated 4 years ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,764Mar 28, 2026Updated last month
- ☆466May 30, 2023Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆320Nov 1, 2024Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆770Jan 26, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Majesty Diffusion by @Dango233(@Dango233max) and @apolinario (@multimodalart)☆275Jul 25, 2022Updated 3 years ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,632Mar 23, 2026Updated 2 months ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,483May 31, 2023Updated 2 years ago
- T2I-Adapter☆3,803Jun 21, 2024Updated last year
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Nov 26, 2022Updated 3 years ago
- Consistency Distilled Diff VAE☆2,214Nov 7, 2023Updated 2 years ago
- Efficiently read embedding in streaming from any filesystem☆105Aug 9, 2025Updated 9 months ago
- An open source implementation of CLIP.☆13,835Updated this week
- GenEval: An object-focused framework for evaluating text-to-image alignment☆454Mar 3, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- High-Resolution Image Synthesis with Latent Diffusion Models☆14,040Feb 29, 2024Updated 2 years ago
- ☆402Jul 11, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,930Jan 8, 2026Updated 4 months ago
- [ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspect…☆504Aug 12, 2024Updated last year
- ☆258Dec 27, 2023Updated 2 years ago
- ☆203Jul 12, 2024Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,948Aug 15, 2024Updated last year