Schuture / Benchmarking-Awesome-Diffusion-Models
The benchmark of SOTA text-to-image diffusion models with a new benchmarking strategy based on MiniGPT-4, namely X-IQE.
☆110Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Benchmarking-Awesome-Diffusion-Models
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆211Updated 2 weeks ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆399Updated 5 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆105Updated 4 months ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆266Updated last year
- GenEval: An object-focused framework for evaluating text-to-image alignment☆120Updated 3 months ago
- Official code of SmartEdit [CVPR-2024 Highlight]☆254Updated 5 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆141Updated last month
- Implementation of "DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning"☆80Updated last year
- ☆114Updated 4 months ago
- [ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"☆261Updated 8 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆150Updated last month
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆100Updated 4 months ago
- ☆73Updated 7 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆242Updated 3 weeks ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆111Updated this week
- ☆93Updated 6 months ago
- ☆113Updated 4 months ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆207Updated last month
- The official Pytorch Implementation for ElasticDiffusion: Training-free Arbitrary Size Image Generation (CVPR 2024)☆153Updated 7 months ago
- 🚀 Cross attention map tools for huggingface/diffusers☆153Updated last week
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆117Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆240Updated 8 months ago
- Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step☆156Updated 4 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆281Updated 3 weeks ago
- ☆156Updated last year
- [Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆137Updated this week
- ☆94Updated 11 months ago
- An unofficial implement of DiffEdit on stable-diffusion☆69Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆122Updated 5 months ago