Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
☆564May 9, 2024Updated last year
Alternatives and similar repositories for TextRL
Users that are interested in TextRL are comparing it to the libraries listed below
Sorting:
- A modular RL library to fine-tune language models to human preferences☆2,378Mar 1, 2024Updated last year
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,741Jan 8, 2024Updated 2 years ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.☆90Nov 23, 2022Updated 3 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆174Apr 7, 2023Updated 2 years ago
- Train transformer language models with reinforcement learning.☆17,460Updated this week
- 🤖📇 handling multiple nlp task in one pipeline☆57Sep 18, 2025Updated 5 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Mar 1, 2023Updated 2 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆475Mar 7, 2024Updated last year
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- ☆98May 30, 2023Updated 2 years ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,377Jul 25, 2023Updated 2 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆20Feb 23, 2021Updated 5 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Jun 29, 2023Updated 2 years ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,816Jun 17, 2025Updated 8 months ago
- PFRL: a PyTorch-based deep reinforcement learning library☆1,261Dec 15, 2025Updated 2 months ago
- A curated list of reinforcement learning with human feedback resources (continually updated)☆4,301Dec 9, 2025Updated 2 months ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,801Oct 12, 2025Updated 4 months ago
- ☆26Nov 21, 2022Updated 3 years ago
- [NIPS2023] RRHF & Wombat☆809Sep 22, 2023Updated 2 years ago
- 🏃 hosting nlp models in one line☆20May 8, 2024Updated last year
- Diffusion-LM☆1,224Aug 8, 2024Updated last year
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Feb 13, 2024Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- Aligning pretrained language models with instruction data generated by themselves.☆4,576Mar 27, 2023Updated 2 years ago
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,039Sep 19, 2024Updated last year
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- Instruction Tuning with GPT-4☆4,342Jun 11, 2023Updated 2 years ago
- ☆184May 26, 2023Updated 2 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,936Mar 14, 2024Updated last year
- Code for "Learning to summarize from human feedback"☆1,059Sep 5, 2023Updated 2 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆348Dec 20, 2022Updated 3 years ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,143Jan 4, 2024Updated 2 years ago
- Evaluation code for various unsupervised automated metrics for Natural Language Generation.☆1,391Aug 20, 2024Updated last year
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,065Mar 7, 2024Updated last year
- Paper List for Style Transfer in Text☆1,623Mar 16, 2023Updated 2 years ago