Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆661Nov 10, 2025Updated 3 months ago
Alternatives and similar repositories for DiffusionDPO
Users that are interested in DiffusionDPO are comparing it to the libraries listed below
Sorting:
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆744Mar 22, 2024Updated last year
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆69Aug 16, 2025Updated 6 months ago
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆2,007Nov 4, 2025Updated 3 months ago
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆379Mar 26, 2025Updated 11 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆428Sep 24, 2025Updated 5 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…