[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
☆38Jul 12, 2024Updated last year
Alternatives and similar repositories for tdpo
Users that are interested in tdpo are comparing it to the libraries listed below
Sorting:
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆59Feb 12, 2025Updated last year
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆20Dec 10, 2024Updated last year
- ☆23Oct 20, 2023Updated 2 years ago
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆51Jun 17, 2025Updated 8 months ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆205Mar 1, 2026Updated last week
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 9 months ago
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆61Sep 19, 2025Updated 5 months ago
- Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"☆22Feb 28, 2026Updated last week
- Official repo for "TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series"☆28May 14, 2025Updated 9 months ago
- ☆17Feb 22, 2024Updated 2 years ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Jun 11, 2024Updated last year
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆20May 24, 2025Updated 9 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆152Feb 14, 2025Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆314Nov 1, 2024Updated last year
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆69Aug 16, 2025Updated 6 months ago
- GenDR: Lightning Generative Detail Restorator☆35Feb 24, 2026Updated last week
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- Code for "Optimizing DDPM Sampling with Shortcut Fine-Tuning" (https://arxiv.org/abs/2301.13362), ICML 2023☆30Oct 6, 2023Updated 2 years ago
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆746Mar 22, 2024Updated last year
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆36Sep 18, 2025Updated 5 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆667Nov 10, 2025Updated 3 months ago
- Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM☆79Apr 19, 2025Updated 10 months ago
- Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models☆78Oct 23, 2023Updated 2 years ago
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆25May 26, 2025Updated 9 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆130Nov 18, 2024Updated last year
- ☆26Apr 27, 2025Updated 10 months ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Oct 31, 2025Updated 4 months ago
- Source code of PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications.☆33May 16, 2025Updated 9 months ago
- This the implementation of LeCo☆31Jan 20, 2025Updated last year
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆30Apr 7, 2023Updated 2 years ago
- ☆42Nov 13, 2024Updated last year
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆84Dec 5, 2024Updated last year
- [NeurIPS 2025🔥:] EVODiff is an inference-time refinement method for diffusion models that improves sampling efficiency and generative f…☆29Feb 2, 2026Updated last month
- ☆82Nov 25, 2024Updated last year