tmabraham / ddpo-pytorch

Reproduction of DDPO paper (RLHF for diffusion)
70Updated last year

Related projects: