lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
7,705Updated 10 months ago

Related projects

Alternatives and complementary repositories for PaLM-rlhf-pytorch