thu-ml / SRPO

Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
42Updated last year

Alternatives and similar repositories for SRPO:

Users that are interested in SRPO are comparing it to the libraries listed below