ZiyiZhang27/tdpo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZiyiZhang27/tdpo)

ZiyiZhang27 / tdpo

[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"

☆38

Alternatives and similar repositories for tdpo

Users that are interested in tdpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kvablack / LLaVA-server
View on GitHub
☆23Oct 20, 2023Updated 2 years ago
krafton-ai / DAS
View on GitHub
Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight
☆65Feb 12, 2025Updated last year
zhaoyl18 / SEIKO
View on GitHub
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…
☆30Jul 18, 2024Updated last year
Gift-OYS / MMSite
View on GitHub
This is an official implementation for "MMSite: A Multi-modal Framework for the Identification of Active Sites in Proteins".
☆18Jan 4, 2025Updated last year
Kwai-Kolors / LPO
View on GitHub
Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
☆68Sep 19, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ZhenglinZhou / DreamDPO
View on GitHub
[ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
☆21May 24, 2025Updated last year
wfanyue / DPG-T2I-Personalization
View on GitHub
[ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
☆50Jun 17, 2025Updated last year
ZiyiZhang27 / sdpo
View on GitHub
[IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"
☆22Feb 25, 2026Updated 4 months ago
jacklishufan / diffusion-kto
View on GitHub
The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility
☆69Aug 16, 2025Updated 10 months ago
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
MiliLab / AnesSuite
View on GitHub
Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"
☆25Feb 28, 2026Updated 4 months ago
masa-ue / RLfinetuning_Diffusion_Bioseq
View on GitHub
Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences li…
☆159Sep 15, 2024Updated last year
UW-Madison-Lee-Lab / SFT-PG
View on GitHub
Code for "Optimizing DDPM Sampling with Shortcut Fine-Tuning" (https://arxiv.org/abs/2301.13362), ICML 2023
☆30Oct 6, 2023Updated 2 years ago
mapo-t2i / mapo
View on GitHub
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
☆82Jun 11, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mihirp1998 / AlignProp
View on GitHub
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…
☆322Nov 1, 2024Updated last year
SalesforceAIResearch / DiffusionDPO
View on GitHub
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆702Jun 2, 2026Updated last month
xiaolul2 / Interp3D
View on GitHub
[ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."
☆31Jan 21, 2026Updated 5 months ago
MinkaiXu / fPO
View on GitHub
f-PO: Generalizing Preference Optimization with f-divergence Minimization
☆14Apr 2, 2025Updated last year
icandle / GenDR
View on GitHub
GenDR: Lightning Generative Detail Restorator
☆38Feb 24, 2026Updated 4 months ago
sungyeonparkk / NuPlanQA
View on GitHub
☆20Jul 6, 2025Updated last year
ldcq / ldcq
View on GitHub
☆35May 24, 2023Updated 3 years ago
ChocoWu / SeTok
View on GitHub
Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM
☆81Apr 19, 2025Updated last year
sail-sg / Rigging-ChatbotArena
View on GitHub
Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)
☆27Feb 25, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
yk7333 / d3po
View on GitHub
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
☆244Apr 6, 2024Updated 2 years ago
rolandwonglonam / claude-english-immersion
View on GitHub
Claude Code skills for passive English immersion + PTE exam prep. Turn daily AI conversations into language learning.
☆65Apr 19, 2026Updated 2 months ago
zihou98 / Whole-Slide-Image
View on GitHub
Working note for WSI analysis
☆10Apr 3, 2023Updated 3 years ago
zhiyuns / UNITPathSSL
View on GitHub
Official PyTorch implementation of the TMI paper "Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for…
☆16Mar 13, 2024Updated 2 years ago
csmile-1006 / REDS_agent
View on GitHub
Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)
☆19Apr 11, 2025Updated last year
tgxs002 / HPSv2
View on GitHub
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆676May 24, 2024Updated 2 years ago
Sun1992 / HPUN
View on GitHub
☆12Dec 9, 2022Updated 3 years ago
woojeongjin / FewVLM
View on GitHub
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
☆42May 13, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
supertone-inc / supertonic-py
View on GitHub
Lightning-Fast, On-Device TTS — running natively via ONNX.
☆82May 18, 2026Updated last month
j-min / VPGen
View on GitHub
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆57Jul 25, 2023Updated 2 years ago
MikaStars39 / StableMask
View on GitHub
PyTorch implementation of StableMask (ICML'24)
☆15Jun 27, 2024Updated 2 years ago
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆165Jun 26, 2025Updated last year
dbsxodud-11 / PAG
View on GitHub
Official Code for Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation (CVPR 2025)
☆15Apr 2, 2025Updated last year
krafton-ai / Rare-to-Frequent
View on GitHub
Rare-to-Frequent (R2F), ICLR'25, Spotlight
☆53Apr 23, 2025Updated last year
CRIPAC-DIG / SCGAN
View on GitHub
[ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"
☆11Apr 26, 2024Updated 2 years ago