voidful / TextRLView on GitHub
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
564Apr 23, 2026Updated last week

Alternatives and similar repositories for TextRL

Users that are interested in TextRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?