jannerm / ddpoLinks
Code for the paper "Training Diffusion Models with Reinforcement Learning"
☆547Updated 2 years ago
Alternatives and similar repositories for ddpo
Users that are interested in ddpo are comparing it to the libraries listed below
Sorting:
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆725Updated last year
- ☆714Updated last year
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,129Updated 2 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆641Updated 2 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆310Updated last year
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆242Updated last year
- Implementation of Autoregressive Diffusion in Pytorch☆431Updated last month
- The collection of awesome papers on alignment of diffusion models.