voidful / TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
549Updated 8 months ago

Alternatives and similar repositories for TextRL:

Users that are interested in TextRL are comparing it to the libraries listed below