microsoft / RLHF-APA

RL algorithm: Advantage induced policy alignment
62Updated last year

Alternatives and similar repositories for RLHF-APA:

Users that are interested in RLHF-APA are comparing it to the libraries listed below