mihirp1998 / AlignPropLinks

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
302Updated last year

Alternatives and similar repositories for AlignProp

Users that are interested in AlignProp are comparing it to the libraries listed below

Sorting: