SalesforceAIResearch/DiffusionDPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SalesforceAIResearch/DiffusionDPO)

SalesforceAIResearch / DiffusionDPO

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

☆704

Alternatives and similar repositories for DiffusionDPO

Users that are interested in DiffusionDPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jacklishufan / diffusion-kto
View on GitHub
The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility
☆69Aug 16, 2025Updated 11 months ago
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,420May 7, 2026Updated 2 months ago
kvablack / ddpo-pytorch
View on GitHub
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
☆768Mar 22, 2024Updated 2 years ago
zai-org / VisionReward
View on GitHub
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆421Mar 26, 2025Updated last year
mihirp1998 / AlignProp
View on GitHub
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…
☆324Nov 1, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yk7333 / d3po
View on GitHub
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
☆244Apr 6, 2024Updated 2 years ago
zai-org / ImageReward
View on GitHub
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
☆1,694Oct 29, 2025Updated 8 months ago
KlingAIResearch / VideoAlign
View on GitHub
[NeurIPS 2025] Improving Video Generation with Human Feedback
☆483Sep 24, 2025Updated 9 months ago
tgxs002 / HPSv2
View on GitHub
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆677May 24, 2024Updated 2 years ago
RockeyCoss / SPO
View on GitHub
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
☆271Apr 7, 2025Updated last year
XueZeyue / DanceGRPO
View on GitHub
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
☆1,635Oct 16, 2025Updated 9 months ago
jannerm / ddpo
View on GitHub
Code for the paper "Training Diffusion Models with Reinforcement Learning"
☆573Jul 5, 2023Updated 3 years ago
CIntellifusion / VideoDPO
View on GitHub
Official Implementation of VideoDPO
☆169Jun 1, 2025Updated last year
xie-lab-ml / awesome-alignment-of-diffusion-models
View on GitHub
[ACM Computing Surveys] The collection of awesome papers on alignment of diffusion models.
☆431Feb 6, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yuvalkirstain / PickScore
View on GitHub
☆600Dec 21, 2024Updated last year
Shentao-YANG / Dense_Reward_T2I
View on GitHub
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
☆39May 9, 2024Updated 2 years ago
CodeGoat24 / UnifiedReward
View on GitHub
Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex
☆796Jun 18, 2026Updated last month
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆974Feb 10, 2026Updated 5 months ago
google-research-datasets / richhf-18k
View on GitHub
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…
☆157Jun 25, 2024Updated 2 years ago
mapo-t2i / mapo
View on GitHub
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
☆83Jun 11, 2024Updated 2 years ago
mihirp1998 / VADER
View on GitHub
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…
☆315Mar 12, 2025Updated last year
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,402Mar 5, 2025Updated last year
TIGER-AI-Lab / VideoScore
View on GitHub
official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]
☆121Dec 4, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
christophschuhmann / improved-aesthetic-predictor
View on GitHub
CLIP+MLP Aesthetic Score Predictor
☆1,323Jul 1, 2024Updated 2 years ago
MizzenAI / HPSv3
View on GitHub
Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)
☆325Dec 5, 2025Updated 7 months ago
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,299Oct 31, 2024Updated last year
ZiyuGuo99 / Image-Generation-CoT
View on GitHub
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
☆865Mar 19, 2026Updated 4 months ago
djghosh13 / geneval
View on GitHub
GenEval: An object-focused framework for evaluating text-to-image alignment
☆470Mar 3, 2025Updated last year
Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,700Mar 23, 2026Updated 3 months ago
VectorSpaceLab / EditScore
View on GitHub
[ICLR 2026] EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling
☆253Mar 20, 2026Updated 4 months ago
gnobitab / RectifiedFlow
View on GitHub
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
☆1,630Jul 20, 2024Updated 2 years ago
CodeGoat24 / Pref-GRPO
View on GitHub
Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
☆274Feb 10, 2026Updated 5 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
willisma / SiT
View on GitHub
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
☆1,187Dec 22, 2025Updated 6 months ago
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,679Mar 16, 2025Updated last year
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,959Aug 15, 2024Updated last year
PKU-YuanGroup / Edit-R1
View on GitHub
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆294Jan 24, 2026Updated 5 months ago
xiefan-guo / initno
View on GitHub
[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
☆80Jun 7, 2024Updated 2 years ago
Yuanshi9815 / OminiControl
View on GitHub
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
☆1,925Jul 2, 2026Updated 2 weeks ago
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,247Feb 16, 2025Updated last year