mihirp1998 / AlignPropLinks

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
298Updated 10 months ago

Alternatives and similar repositories for AlignProp

Users that are interested in AlignProp are comparing it to the libraries listed below

Sorting: