[ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google
☆62Aug 14, 2024Updated last year
Alternatives and similar repositories for dpo-diffusion
Users that are interested in dpo-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆20Feb 25, 2026Updated 2 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆688Nov 10, 2025Updated 5 months ago
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆166Sep 15, 2025Updated 7 months ago
- ☆46Mar 29, 2026Updated last month
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An implementation for MLLM oversensitivity evaluation☆18Nov 16, 2024Updated last year
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆16Apr 14, 2024Updated 2 years ago
- ☆13Jan 14, 2026Updated 3 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆319Nov 1, 2024Updated last year
- ☆191Oct 28, 2024Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,667Oct 29, 2025Updated 6 months ago
- Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.☆367Jul 11, 2024Updated last year
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆17Mar 16, 2025Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆26Aug 23, 2025Updated 8 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆314Mar 12, 2025Updated last year
- Blending Custom Photos with Video Diffusion Transformers☆50Jan 21, 2025Updated last year
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆119Feb 10, 2026Updated 2 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆137Dec 21, 2024Updated last year
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆169Nov 18, 2024Updated last year
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆83Aug 25, 2025Updated 8 months ago
- ☆17Aug 13, 2024Updated last year
- Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"☆18Dec 14, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 8 months ago
- Evaluating text-to-image/video/3D models with VQAScore☆382Sep 22, 2025Updated 7 months ago
- [ICML 2024] One Prompt is Not Enough: Automated Construction of a Mixture-of-Expert Prompts - TurningPoint AI☆31Sep 25, 2024Updated last year
- Semi-supervised Junction Tree Variational Autoencoder for jointly trained property prediction and molecule structure generation. (AAAI 23…☆12Jan 14, 2023Updated 3 years ago
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆664May 24, 2024Updated last year
- [ECCV2024] "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆62Nov 26, 2024Updated last year
- ☆590Dec 21, 2024Updated last year
- ☆59Mar 9, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆21Nov 19, 2021Updated 4 years ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆33Apr 3, 2026Updated last month
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance☆51Oct 8, 2024Updated last year
- Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode☆17Feb 16, 2025Updated last year
- The code of “DreamFuse: Adaptive Image Fusion with Diffusion Transformer”.☆27Jul 25, 2025Updated 9 months ago
- Pytorch implementation for "Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning" (ICML 2024)☆26May 11, 2025Updated 11 months ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆236Jul 11, 2024Updated last year