microsoft / RLHF-APAView on GitHub
RL algorithm: Advantage induced policy alignment
66Aug 11, 2023Updated 2 years ago

Alternatives and similar repositories for RLHF-APA

Users that are interested in RLHF-APA are comparing it to the libraries listed below

Sorting:

Are these results useful?