christophschuhmann / improved-aesthetic-predictorLinks
CLIP+MLP Aesthetic Score Predictor
☆1,247Updated last year
Alternatives and similar repositories for improved-aesthetic-predictor
Users that are interested in improved-aesthetic-predictor are comparing it to the libraries listed below
Sorting:
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆660Updated 3 years ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,626Updated 3 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆642Updated last year
- ☆576Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆763Updated 2 years ago
- Diffusion attentive attribution maps for interpreting Stable Diffusion.☆785Updated last year
- [ICCV 2023] Consistent Image Synthesis and Editing☆836Updated last year
- Paint by Example: Exemplar-based Image Editing with Diffusion Models☆1,242Updated 2 years ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆665Updated last year
- Transfer the ControlNet with any basemodel in diffusers🔥☆846Updated 2 years ago
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆620Updated last year
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,056Updated 2 years ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆920Updated last year
- The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥☆824Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆712Updated last year
- A prompting enhancement library for transformers-type text embedding systems☆604Updated 2 months ago
- Unified Controllable Visual Generation Model☆657Updated last year
- Erasing Concepts from Diffusion Models☆654Updated 5 months ago
- A collection of resources on controllable generation with text-to-image diffusion models.☆1,110Updated last year
- ICLR 2024 (Spotlight)☆783Updated last year
- ☆474Updated 7 months ago
- ☆482Updated 8 months ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆543Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆674Updated last year
- Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]☆629Updated last year
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆481Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆521Updated last year
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)☆585Updated 2 years ago
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆500Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Updated 2 years ago