SRDdev / PaLM-RLHF

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
30Updated 2 weeks ago

Alternatives and similar repositories for PaLM-RLHF:

Users that are interested in PaLM-RLHF are comparing it to the libraries listed below