anshradh / trl_custom

Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.
14Updated 2 years ago

Related projects: