LAION-AI / aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of pictures
☆544Updated 2 years ago
Alternatives and similar repositories for aesthetic-predictor:
Users that are interested in aesthetic-predictor are comparing it to the libraries listed below
- ☆501Updated 4 months ago
- CLIP+MLP Aesthetic Score Predictor☆1,071Updated 10 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆498Updated 11 months ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆282Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆734Updated last year
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆315Updated last year
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆534Updated last year
- Erasing Concepts from Diffusion Models☆602Updated 3 weeks ago
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆503Updated 5 months ago
- Diffusion attentive attribution maps for interpreting Stable Diffusion.☆752Updated last year
- Description and pointers of laion datasets☆246Updated 2 years ago
- Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]☆598Updated 11 months ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆404Updated last year
- Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]☆574Updated 11 months ago
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆379Updated last year
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆325Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆282Updated 6 months ago
- ☆312Updated 3 months ago
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆478Updated 5 months ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆220Updated 11 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆420Updated last year
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆644Updated 9 months ago
- Large-scale text-video dataset. 10 million captioned short videos.☆631Updated 8 months ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆470Updated 7 months ago
- [ICCV 2023] Consistent Image Synthesis and Editing☆792Updated 8 months ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆354Updated last year
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆427Updated 11 months ago
- ☆465Updated 7 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆254Updated 2 months ago
- An implementation of the Prompt-to-Prompt paper for the SDXL architecture☆110Updated 10 months ago