Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
☆564Apr 23, 2026Updated last week
Alternatives and similar repositories for TextRL
Users that are interested in TextRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A modular RL library to fine-tune language models to human preferences☆2,388Mar 1, 2024Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,745Jan 8, 2024Updated 2 years ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.☆91Nov 23, 2022Updated 3 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆174Apr 7, 2023Updated 3 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Train transformer language models with reinforcement learning.☆18,193Updated this week
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- ☆98May 30, 2023Updated 2 years ago
- 🤖📇 handling multiple nlp task in one pipeline☆57Sep 18, 2025Updated 7 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆307Mar 1, 2023Updated 3 years ago
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,870Oct 11, 2025Updated 6 months ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆476Mar 7, 2024Updated 2 years ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,387Jul 25, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PFRL: a PyTorch-based deep reinforcement learning library☆1,269Mar 2, 2026Updated last month
- ☆31Jul 13, 2023Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- 🏃 hosting nlp models in one line☆20May 8, 2024Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,840Jun 17, 2025Updated 10 months ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆21Feb 23, 2021Updated 5 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Jun 29, 2023Updated 2 years ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,811Mar 21, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 9 months ago
- Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Textual Style Transfer☆36Oct 2, 2022Updated 3 years ago
- Code for "Learning to summarize from human feedback"☆1,063Sep 5, 2023Updated 2 years ago
- Diffusion-LM☆1,237Aug 8, 2024Updated last year
- ☆26Nov 21, 2022Updated 3 years ago
- A curated list of reinforcement learning with human feedback resources (continually updated)☆4,352Dec 9, 2025Updated 4 months ago
- ☆35Nov 17, 2021Updated 4 years ago
- [NIPS2023] RRHF & Wombat☆808Sep 22, 2023Updated 2 years ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆181Feb 13, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- 聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。☆270Apr 23, 2026Updated last week
- Instruction Tuning with GPT-4☆4,337Jun 11, 2023Updated 2 years ago
- ☆34Mar 25, 2023Updated 3 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- Aligning pretrained language models with instruction data generated by themselves.☆4,591Mar 27, 2023Updated 3 years ago
- Revolutionize your development workflow with AI-powered code assistance, automating mock tests, suggestions, and unit test generation in …☆33Feb 27, 2025Updated last year