Textbook on reinforcement learning from human feedback
☆1,819Apr 15, 2026Updated this week
Alternatives and similar repositories for rlhf-book
Users that are interested in rlhf-book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Democratizing Reinforcement Learning for LLMs☆5,439Updated this week
- AllenAI's post-training codebase☆3,683Updated this week
- Train transformer language models with reinforcement learning.☆18,054Updated this week
- Agentic RL Training at Scale☆1,292Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,146Aug 26, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Our library for RL environments + evals☆4,016Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆2,090Dec 3, 2025Updated 4 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,603Apr 10, 2026Updated last week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,397Mar 28, 2026Updated 3 weeks ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…☆9,340Updated this week
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,364Apr 6, 2026Updated last week
- Everything about the SmolLM and SmolVLM family of models☆3,705Apr 2, 2026Updated 2 weeks ago
- NanoGPT (124M) in 2 minutes☆5,095Updated this week
- Minimal reproduction of DeepSeek R1-Zero