christophschuhmann / improved-aesthetic-predictorLinks
CLIP+MLP Aesthetic Score Predictor
☆1,174Updated last year
Alternatives and similar repositories for improved-aesthetic-predictor
Users that are interested in improved-aesthetic-predictor are comparing it to the libraries listed below
Sorting:
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆591Updated 3 years ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,519Updated 7 months ago
- ☆546Updated 9 months ago
- Diffusion attentive attribution maps for interpreting Stable Diffusion.☆773Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆578Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆749Updated last year
- Transfer the ControlNet with any basemodel in diffusers🔥☆842Updated 2 years ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆655Updated last year
- [ICCV 2023] Consistent Image Synthesis and Editing☆810Updated last year
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆608Updated last year
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,042Updated 2 years ago
- Paint by Example: Exemplar-based Image Editing with Diffusion Models☆1,216Updated last year
- Erasing Concepts from Diffusion Models☆634Updated last month
- The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥☆821Updated last year
- Unified Controllable Visual Generation Model☆648Updated 7 months ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆476Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆706Updated 8 months ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆540Updated last year
- ☆472Updated 3 months ago
- ICLR 2024 (Spotlight)☆774Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆758Updated last year
- Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]☆617Updated last year
- ☆461Updated 4 months ago
- A prompting enhancement library for transformers-type text embedding systems☆583Updated 4 months ago
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)☆583Updated 2 years ago
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆520Updated last year
- A collection of resources on controllable generation with text-to-image diffusion models.☆1,077Updated 8 months ago
- Open-Set Grounded Text-to-Image Generation☆2,157Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆911Updated last year
- Large-scale text-video dataset. 10 million captioned short videos.☆657Updated last year