[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
☆38Jul 12, 2024Updated last year
Alternatives and similar repositories for tdpo
Users that are interested in tdpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- ☆23Oct 20, 2023Updated 2 years ago
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆60Feb 12, 2025Updated last year
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- This is an official implementation for "MMSite: A Multi-modal Framework for the Identification of Active Sites in Proteins".☆18Jan 4, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆51Jun 17, 2025Updated 10 months ago
- Rewarded soups official implementation☆63Sep 27, 2023Updated 2 years ago
- [IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆20Feb 25, 2026Updated last month
- Official repo for "TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series"☆28May 14, 2025Updated 11 months ago
- Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"☆22Feb 28, 2026Updated last month
- Code for "Optimizing DDPM Sampling with Shortcut Fine-Tuning" (https://arxiv.org/abs/2301.13362), ICML 2023☆30Oct 6, 2023Updated 2 years ago
- ☆11Dec 15, 2025Updated 4 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆318Nov 1, 2024Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆682Nov 10, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆39Sep 22, 2024Updated last year
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆26Jan 21, 2026Updated 2 months ago
- GenDR: Lightning Generative Detail Restorator☆36Feb 24, 2026Updated last month
- ☆36May 24, 2023Updated 2 years ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆29Oct 30, 2024Updated last year
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM☆81Apr 19, 2025Updated last year
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Dec 10, 2024Updated last year
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆244Apr 6, 2024Updated 2 years ago
- Working note for WSI analysis☆10Apr 3, 2023Updated 3 years ago
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated 2 months ago
- ☆59Sep 23, 2024Updated last year
- Official PyTorch implementation of the TMI paper "Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for…☆16Mar 13, 2024Updated 2 years ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Apr 11, 2025Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆661May 24, 2024Updated last year
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆42May 13, 2022Updated 3 years ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆30Apr 7, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆17Mar 10, 2025Updated last year
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆57Jul 25, 2023Updated 2 years ago
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- ☆20May 16, 2024Updated last year
- Layerwise Batch Entropy Regularization☆24Aug 3, 2022Updated 3 years ago
- ☆13Jun 5, 2024Updated last year
- Official Code for Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation (CVPR 2025)☆14Apr 2, 2025Updated last year