andy-yangz / Awesome-RLHF

Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD
22Updated last year

Related projects: