voidful / TextRLLinks
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
☆566Updated last year
Alternatives and similar repositories for TextRL
Users that are interested in TextRL are comparing it to the libraries listed below
Sorting:
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆173Updated 2 years ago
- Expanding natural instructions☆1,023Updated last year
- ☆98Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆462Updated 3 years ago
- Original Implementation of Prompt Tuning from Lester, et al, 2021☆696Updated 8 months ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆473Updated last year
- A research project for natural language generation, containing the official implementations by MSRA NLC team.☆739Updated last year
- Code for "Learning to summarize from human feedback"☆1,052Updated 2 years ago
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆473Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆832Updated last year
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,374Updated 2 years ago
- A modular RL library to fine-tune language models to human preferences☆2,366Updated last year
- Crosslingual Generalization through Multitask Finetuning