tgxs002/HPSv2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tgxs002/HPSv2)

tgxs002 / HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

☆677

Alternatives and similar repositories for HPSv2

Users that are interested in HPSv2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuvalkirstain / PickScore
View on GitHub
☆601Dec 21, 2024Updated last year
zai-org / ImageReward
View on GitHub
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
☆1,694Oct 29, 2025Updated 8 months ago
tgxs002 / align_sd
View on GitHub
Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
☆293Jul 14, 2023Updated 3 years ago
Kwai-Kolors / MPS
View on GitHub
☆206Jul 12, 2024Updated 2 years ago
mihirp1998 / AlignProp
View on GitHub
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…
☆324Nov 1, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
christophschuhmann / improved-aesthetic-predictor
View on GitHub
CLIP+MLP Aesthetic Score Predictor
☆1,328Jul 1, 2024Updated 2 years ago
MizzenAI / HPSv3
View on GitHub
Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)
☆328Dec 5, 2025Updated 7 months ago
SalesforceAIResearch / DiffusionDPO
View on GitHub
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆705Jun 2, 2026Updated last month
zai-org / VisionReward
View on GitHub
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆422Mar 26, 2025Updated last year
yk7333 / d3po
View on GitHub
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
☆244Apr 6, 2024Updated 2 years ago
google-research-datasets / richhf-18k
View on GitHub
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…
☆157Jun 25, 2024Updated 2 years ago
djghosh13 / geneval
View on GitHub
GenEval: An object-focused framework for evaluating text-to-image alignment
☆472Mar 3, 2025Updated last year
RockeyCoss / SPO
View on GitHub
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
☆271Apr 7, 2025Updated last year
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,410Mar 5, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mihirp1998 / VADER
View on GitHub
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…
☆315Mar 12, 2025Updated last year
Shentao-YANG / Dense_Reward_T2I
View on GitHub
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
☆39May 9, 2024Updated 2 years ago
kvablack / ddpo-pytorch
View on GitHub
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
☆768Mar 22, 2024Updated 2 years ago
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,300Oct 31, 2024Updated last year
CaraJ7 / CoMat
View on GitHub
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
☆169Nov 18, 2024Updated last year
LAION-AI / aesthetic-predictor
View on GitHub
A linear estimator on top of clip to predict the aesthetic quality of pictures
☆726Aug 15, 2022Updated 3 years ago
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆985Feb 10, 2026Updated 5 months ago
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆345May 7, 2026Updated 2 months ago
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,430May 7, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,706Mar 23, 2026Updated 4 months ago
jannerm / ddpo
View on GitHub
Code for the paper "Training Diffusion Models with Reinforcement Learning"
☆573Jul 5, 2023Updated 3 years ago
CodeGoat24 / UnifiedReward
View on GitHub
Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex
☆796Jun 18, 2026Updated last month
XueZeyue / DanceGRPO
View on GitHub
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
☆1,641Oct 16, 2025Updated 9 months ago
xie-lab-ml / awesome-alignment-of-diffusion-models
View on GitHub
[ACM Computing Surveys] The collection of awesome papers on alignment of diffusion models.
☆431Feb 6, 2026Updated 5 months ago
linzhiqiu / t2v_metrics
View on GitHub
Evaluating text-to-image/video/3D models with VQAScore
☆597Jun 5, 2026Updated last month
mapo-t2i / mapo
View on GitHub
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
☆83Jun 11, 2024Updated 2 years ago
Yushi-Hu / tifa
View on GitHub
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
☆186Apr 29, 2024Updated 2 years ago
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,247Feb 16, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
j-min / DSG
View on GitHub
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
☆109Dec 9, 2024Updated last year
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,285Jul 17, 2024Updated 2 years ago
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,680Mar 16, 2025Updated last year
ExplainableML / ReNO
View on GitHub
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
☆166Sep 15, 2025Updated 10 months ago
Q-Future / Q-Align
View on GitHub
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
☆611Jun 24, 2026Updated last month
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,839Feb 1, 2025Updated last year
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year