yk7333/d3po

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yk7333/d3po)

yk7333 / d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

☆244

Alternatives and similar repositories for d3po

Users that are interested in d3po are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Shentao-YANG / Dense_Reward_T2I
View on GitHub
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
☆39May 9, 2024Updated 2 years ago
kvablack / ddpo-pytorch
View on GitHub
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
☆768Mar 22, 2024Updated 2 years ago
RockeyCoss / SPO
View on GitHub
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
☆271Apr 7, 2025Updated last year
mapo-t2i / mapo
View on GitHub
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
☆83Jun 11, 2024Updated 2 years ago
SalesforceAIResearch / DiffusionDPO
View on GitHub
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆706Jun 2, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jacklishufan / diffusion-kto
View on GitHub
The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility
☆69Aug 16, 2025Updated 11 months ago
tgxs002 / HPSv2
View on GitHub
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆677May 24, 2024Updated 2 years ago
mihirp1998 / AlignProp
View on GitHub
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…
☆324Nov 1, 2024Updated last year
jannerm / ddpo
View on GitHub
Code for the paper "Training Diffusion Models with Reinforcement Learning"
☆574Jul 5, 2023Updated 3 years ago
zai-org / ImageReward
View on GitHub
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
☆1,695Oct 29, 2025Updated 9 months ago
yuvalkirstain / PickScore
View on GitHub
☆601Dec 21, 2024Updated last year
yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
mihirp1998 / VADER
View on GitHub
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…
☆315Mar 12, 2025Updated last year
zai-org / VisionReward
View on GitHub
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆422Mar 26, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
google-research-datasets / richhf-18k
View on GitHub
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…
☆157Jun 25, 2024Updated 2 years ago
kvablack / LLaVA-server
View on GitHub
☆22Oct 20, 2023Updated 2 years ago
tgxs002 / align_sd
View on GitHub
Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
☆293Jul 14, 2023Updated 3 years ago
qqingzheng / AI-Self-Training-DPO-SDXL
View on GitHub
Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.
☆66Feb 24, 2024Updated 2 years ago
hu-zijing / B2-DiffuRL
View on GitHub
[CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.
☆57Mar 31, 2025Updated last year
louisYen / Gen4Gen
View on GitHub
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
☆110Mar 27, 2026Updated 4 months ago
Kwai-Kolors / MPS
View on GitHub
☆206Jul 12, 2024Updated 2 years ago
pinterest / atg-research
View on GitHub
☆74Sep 23, 2025Updated 10 months ago
Hannieliao / Baton
View on GitHub
Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"
☆32Mar 4, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
xie-lab-ml / awesome-alignment-of-diffusion-models
View on GitHub
[ACM Computing Surveys] The collection of awesome papers on alignment of diffusion models.
☆430Feb 6, 2026Updated 5 months ago
christophschuhmann / improved-aesthetic-predictor
View on GitHub
CLIP+MLP Aesthetic Score Predictor
☆1,328Jul 1, 2024Updated 2 years ago
Owen-Oertell / rlcm
View on GitHub
☆58Sep 23, 2024Updated last year
ZiyuGuo99 / Image-Generation-CoT
View on GitHub
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
☆865Mar 19, 2026Updated 4 months ago
Mowenyii / PAE
View on GitHub
[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation
☆87Jul 13, 2024Updated 2 years ago
CodeGoat24 / UnifiedReward
View on GitHub
Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex
☆796Jun 18, 2026Updated last month
invictus717 / InteractiveVideo
View on GitHub
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
☆133Feb 7, 2024Updated 2 years ago
discus0434 / aesthetic-predictor-v2-5
View on GitHub
SigLIP-based Aesthetic Score Predictor
☆426Dec 18, 2024Updated last year
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,415Mar 5, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mlpc-ucsd / TokenCompose
View on GitHub
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
☆137Dec 21, 2024Updated last year
G-U-N / Diffusion-NPO
View on GitHub
[ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…
☆39Jan 26, 2026Updated 6 months ago
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,437May 7, 2026Updated 2 months ago
CaraJ7 / CoMat
View on GitHub
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
☆169Nov 18, 2024Updated last year
KlingAIResearch / VideoAlign
View on GitHub
[NeurIPS 2025] Improving Video Generation with Human Feedback
☆489Sep 24, 2025Updated 10 months ago
XueZeyue / DanceGRPO
View on GitHub
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
☆1,642Oct 16, 2025Updated 9 months ago
cientgu / InstructDiffusion
View on GitHub
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
☆445May 14, 2024Updated 2 years ago