LAION-AI / aesthetic-predictorLinks
A linear estimator on top of clip to predict the aesthetic quality of pictures
☆646Updated 3 years ago
Alternatives and similar repositories for aesthetic-predictor
Users that are interested in aesthetic-predictor are comparing it to the libraries listed below
Sorting:
- CLIP+MLP Aesthetic Score Predictor☆1,234Updated last year
- ☆565Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆760Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆634Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆293Updated 2 years ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆542Updated last year
- Diffusion attentive attribution maps for interpreting Stable Diffusion.☆785Updated last year
- [ICCV 2023] Consistent Image Synthesis and Editing☆828Updated last year
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆382Updated last year
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆320Updated 2 years ago
- Erasing Concepts from Diffusion Models☆645Updated 4 months ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆665Updated last year
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆480Updated last year
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆506Updated 2 months ago
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆522Updated last year
- Mixture of Diffusers for scene composition and high resolution image generation☆448Updated 2 years ago
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆324Updated 2 years ago
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆499Updated last year
- Large-scale text-video dataset. 10 million captioned short videos.☆668Updated last year
- DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual A…☆567Updated last month
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆617Updated last year
- Description and pointers of laion datasets☆248Updated 3 years ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,603Updated last month
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆919Updated last year
- [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".☆393Updated 10 months ago
- Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]☆626Updated last year
- This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.☆225Updated 2 years ago
- NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models☆428Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆711Updated 11 months ago
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)☆583Updated 2 years ago