ZiyiZhang27 / tdpoView external linksLinks
[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
☆38Jul 12, 2024Updated last year
Alternatives and similar repositories for tdpo
Users that are interested in tdpo are comparing it to the libraries listed below
Sorting:
- The first Object-Oriented Programming (OOP) Evaluaion Benchmark for LLMs☆27Jan 15, 2025Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆59Feb 12, 2025Updated last year
- ☆23Oct 20, 2023Updated 2 years ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Dec 10, 2024Updated last year
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆29Jul 18, 2024Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆51Jun 17, 2025Updated 8 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 8 months ago
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆61Sep 19, 2025Updated 4 months ago
- Official repo for "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"☆22Jan 18, 2026Updated 3 weeks ago
- Official repo for "TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series"☆28May 14, 2025Updated 9 months ago
- ☆17Feb 22, 2024Updated last year
- [IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆19Feb 8, 2026Updated last week
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆20May 24, 2025Updated 8 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- GenDR: Lightning Generative Detail Restorator☆31Mar 13, 2025Updated 11 months ago
- Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences li…☆157Sep 15, 2024Updated last year
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆151Feb 14, 2025Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆311Nov 1, 2024Updated last year
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆69Aug 16, 2025Updated 6 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆741Mar 22, 2024Updated last year
- ☆37Oct 15, 2024Updated last year
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆35Sep 18, 2025Updated 4 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆661Nov 10, 2025Updated 3 months ago
- Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM☆79Apr 19, 2025Updated 9 months ago
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆26May 26, 2025Updated 8 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆129Aug 21, 2024Updated last year
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Oct 31, 2025Updated 3 months ago
- Source code of PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications.☆33May 16, 2025Updated 9 months ago
- ☆32Sep 12, 2024Updated last year
- This the implementation of LeCo☆31Jan 20, 2025Updated last year
- ☆42Nov 13, 2024Updated last year
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Apr 7, 2023Updated 2 years ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆83Dec 5, 2024Updated last year
- Codes for Arctic river segmentation using various fully convolutional neural networks.☆10Dec 27, 2022Updated 3 years ago
- Dynamic, high-resolution poverty measurement in data-scarce environments☆10Dec 8, 2024Updated last year