[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
☆38Jul 12, 2024Updated last year
Alternatives and similar repositories for tdpo
Users that are interested in tdpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆107Sep 11, 2024Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated 2 years ago
- ☆116Sep 12, 2024Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆39May 9, 2024Updated 2 years ago
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆63Feb 12, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆127Sep 12, 2024Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆223May 22, 2026Updated last week
- ☆132Sep 12, 2024Updated last year
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- ☆116Sep 12, 2024Updated last year
- This is an official implementation for "MMSite: A Multi-modal Framework for the Identification of Active Sites in Proteins".☆18Jan 4, 2025Updated last year
- ☆218Jun 17, 2025Updated 11 months ago
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆67Sep 19, 2025Updated 8 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆50Jun 17, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Rewarded soups official implementation☆63Sep 27, 2023Updated 2 years ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆73May 18, 2025Updated last year
- [IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆20Feb 25, 2026Updated 3 months ago
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆69Aug 16, 2025Updated 9 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"☆25Feb 28, 2026Updated 3 months ago
- Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences li…☆159Sep 15, 2024Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Jun 11, 2024Updated last year
- ☆11Dec 15, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆320Nov 1, 2024Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆693Nov 10, 2025Updated 6 months ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆28Jan 21, 2026Updated 4 months ago
- GenDR: Lightning Generative Detail Restorator☆38Feb 24, 2026Updated 3 months ago
- Derivative-Free Guidance in Diffusion Models with Soft Value-Based Decoding. For controlled generation in DNA, RNA, proteins, molecules (…☆39Jan 7, 2026Updated 4 months ago
- ☆21Jul 6, 2025Updated 10 months ago
- ☆35May 24, 2023Updated 3 years ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆29Oct 30, 2024Updated last year
- Offers spaced-repetition algorithms according to Leitner and SuperMemo-2 and teacher functions to give bonus for repetitive learning.☆14May 4, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆27Feb 25, 2025Updated last year
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 5 years ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆243Apr 6, 2024Updated 2 years ago
- Lightning-Fast, On-Device TTS — running natively via ONNX.☆56May 18, 2026Updated last week
- python 3 scraper for coinmarketcap historical data☆11May 28, 2018Updated 8 years ago
- Gamebaby Rock Sun's D3D12 DirectX Ray Tracing C-Style Sample for beginner☆18Feb 5, 2023Updated 3 years ago
- Working note for WSI analysis☆10Apr 3, 2023Updated 3 years ago