SalesforceAIResearch / DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆480Updated 3 months ago
Alternatives and similar repositories for DiffusionDPO
Users that are interested in DiffusionDPO are comparing it to the libraries listed below
Sorting:
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆282Updated 6 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆501Updated 11 months ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆225Updated last year
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆255Updated last month
- GenEval: An object-focused framework for evaluating text-to-image alignment☆255Updated 2 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆218Updated 2 months ago
- ☆503Updated 4 months ago
- An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆170Updated this week
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆496Updated this week
- This repo contains the code for 1D tokenizer and generator☆857Updated last month
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆235Updated last month
- The collection of awesome papers on alignment of diffusion models.☆211Updated this week
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆506Updated 11 months ago
- Scaling Diffusion Transformers with Mixture of Experts☆321Updated 8 months ago
- Code for Fast Training of Diffusion Models with Masked Transformers☆402Updated 11 months ago
- MoVQGAN - model for the image encoding and reconstruction☆233Updated last year
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆275Updated 5 months ago
- [ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"☆321Updated last year
- This is a repo to track the latest autoregressive visual generation papers.☆307Updated last week
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆340Updated 7 months ago
- An open-source toolbox for fast sampling of diffusion models. Official implementations of our works published in ICML, NeurIPS, CVPR.☆282Updated 2 months ago
- Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.☆335Updated 10 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆292Updated 6 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆351Updated 4 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆875Updated 2 months ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,037Updated last month
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆194Updated 3 weeks ago
- [ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy☆247Updated 5 months ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆835Updated last year
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆206Updated last month