A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,748Jan 8, 2024Updated 2 years ago
Alternatives and similar repositories for trlx
Users that are interested in trlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A modular RL library to fine-tune language models to human preferences☆2,387Mar 1, 2024Updated 2 years ago
- Train transformer language models with reinforcement learning.☆18,411Updated this week
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,865Oct 11, 2025Updated 7 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆307Mar 1, 2023Updated 3 years ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,391Jul 25, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"