christophschuhmann / improved-aesthetic-predictorLinks
CLIP+MLP Aesthetic Score Predictor
☆1,238Updated last year
Alternatives and similar repositories for improved-aesthetic-predictor
Users that are interested in improved-aesthetic-predictor are comparing it to the libraries listed below
Sorting:
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆649Updated 3 years ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,614Updated 2 months ago
- ☆567Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆636Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆759Updated last year
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆665Updated last year
- Diffusion attentive attribution maps for interpreting Stable Diffusion.☆785Updated last year
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆619Updated last year
- Erasing Concepts from Diffusion Models☆649Updated 4 months ago
- [ICCV 2023] Consistent Image Synthesis and Editing☆832Updated last year
- Unified Controllable Visual Generation Model☆657Updated 11 months ago
- Paint by Example: Exemplar-based Image Editing with Diffusion Models☆1,240Updated 2 years ago
- Transfer the ControlNet with any basemodel in diffusers🔥☆847Updated 2 years ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,053Updated 2 years ago
- The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥☆823Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆921Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆711Updated 11 months ago
- A prompting enhancement library for transformers-type text embedding systems☆602Updated 2 months ago
- ☆475Updated 6 months ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆480Updated last year
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)☆583Updated 2 years ago
- ☆479Updated 8 months ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆542Updated 2 years ago
- A collection of resources on controllable generation with text-to-image diffusion models.☆1,098Updated last year
- ICLR 2024 (Spotlight)☆781Updated last year
- Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]☆627Updated last year
- Large-scale text-video dataset. 10 million captioned short videos.☆670Updated last year
- Description and pointers of laion datasets☆248Updated 3 years ago
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆522Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆757Updated 2 years ago