[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
☆38Jul 12, 2024Updated last year
Alternatives and similar repositories for tdpo
Users that are interested in tdpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- ☆136Sep 12, 2024Updated last year
- ☆110Sep 12, 2024Updated last year
- ☆23Oct 20, 2023Updated 2 years ago
- ☆121Sep 11, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆129Sep 12, 2024Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆207Updated this week
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆63Sep 19, 2025Updated 6 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆51Jun 17, 2025Updated 9 months ago
- Rewarded soups official implementation☆62Sep 27, 2023Updated 2 years ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆70May 18, 2025Updated 10 months ago
- Official repo for "TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series"☆28May 14, 2025Updated 10 months ago
- Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"☆22Feb 28, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆69Aug 16, 2025Updated 7 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆749Mar 22, 2024Updated 2 years ago
- Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences li…☆158Sep 15, 2024Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Jun 11, 2024Updated last year
- ☆11Dec 15, 2025Updated 3 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆314Nov 1, 2024Updated last year
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆26Jan 21, 2026Updated 2 months ago
- GenDR: Lightning Generative Detail Restorator☆36Feb 24, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Dreamfusion with Stable diffusion backend☆10Oct 4, 2022Updated 3 years ago
- Clue-RAG: Towards Accurate and Cost-Efficient Graph-based RAG via Multi-Partite Graph and Query-Driven Iterative Retrieval☆23Mar 3, 2026Updated 3 weeks ago
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- ☆11Feb 26, 2024Updated 2 years ago
- Offers spaced-repetition algorithms according to Leitner and SuperMemo-2 and teacher functions to give bonus for repetitive learning.☆14May 4, 2022Updated 3 years ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆243Apr 6, 2024Updated last year
- python 3 scraper for coinmarketcap historical data☆11May 28, 2018Updated 7 years ago
- Gamebaby Rock Sun's D3D12 DirectX Ray Tracing C-Style Sample for beginner☆17Feb 5, 2023Updated 3 years ago
- Working note for WSI analysis☆10Apr 3, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated 2 months ago
- ☆13Aug 14, 2022Updated 3 years ago
- Official PyTorch implementation of the TMI paper "Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for…☆16Mar 13, 2024Updated 2 years ago
- ☆59Sep 23, 2024Updated last year
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Apr 11, 2025Updated 11 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆654May 24, 2024Updated last year
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆42May 13, 2022Updated 3 years ago