Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
☆294Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for align_sd
Users that are interested in align_sd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆651May 24, 2024Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,650Oct 29, 2025Updated 4 months ago
- ☆582Dec 21, 2024Updated last year
- CLIP+MLP Aesthetic Score Predictor☆1,266Jul 1, 2024Updated last year
- Code for the paper "Training Diffusion Models with Reinforcement Learning"☆558Jul 5, 2023Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆314Nov 1, 2024Updated last year
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆384Jan 24, 2024Updated 2 years ago
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆714Jan 10, 2025Updated last year
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆750Mar 22, 2024Updated 2 years ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆243Apr 6, 2024Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆767Jan 26, 2024Updated 2 years ago
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆502Nov 14, 2023Updated 2 years ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆298Aug 29, 2025Updated 6 months ago
- This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et…☆195Mar 8, 2023Updated 3 years ago
- Zero-shot Image-to-Image Translation [SIGGRAPH 2023]☆1,145Oct 16, 2024Updated last year
- [TCSVT'24] Offical Implementation of 2AFC-LMMs☆12Aug 17, 2024Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆671Nov 10, 2025Updated 4 months ago
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆522Apr 2, 2024Updated last year
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,365Jul 11, 2024Updated last year
- The benchmark of SOTA text-to-image diffusion models with a new benchmarking strategy based on MiniGPT-4, namely X-IQE.☆131Dec 3, 2025Updated 3 months ago
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆676Aug 15, 2022Updated 3 years ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆317Jul 11, 2024Updated last year
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,476May 31, 2023Updated 2 years ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆182Apr 29, 2024Updated last year
- T2I-Adapter☆3,803Jun 21, 2024Updated last year
- [ICCV 2023] Consistent Image Synthesis and Editing☆843Aug 19, 2024Updated last year
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]☆178Dec 2, 2025Updated 3 months ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆668Jul 17, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,212Mar 6, 2024Updated 2 years ago
- Official PyTorch codes for "Enhancing Diffusion Models with Text-Encoder Reinforcement Learning", ECCV2024☆57Aug 13, 2024Updated last year
- (ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.☆2,429Feb 7, 2026Updated last month
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆480Sep 9, 2024Updated last year