aburkov / theLMbookLinks
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
☆1,796Updated last month
Alternatives and similar repositories for theLMbook
Users that are interested in theLMbook are comparing it to the libraries listed below
Sorting:
- Simple RL training for reasoning☆3,635Updated 2 months ago
- Official Repo for Open-Reasoner-Zero☆1,969Updated 3 weeks ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,656Updated this week
- Democratizing Reinforcement Learning for LLMs☆3,396Updated last month
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,016Updated 3 weeks ago
- Awesome Reasoning LLM Tutorial/Survey/Guide☆1,781Updated last week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,434Updated 5 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆9,958Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,438Updated 2 months ago
- ☆477Updated 2 weeks ago
- A reading list on LLM based Synthetic Data Generation 🔥☆1,310Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,418Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,364Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,773Updated this week
- Curated list of datasets and tools for post-training.☆3,175Updated 4 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,548Updated 3 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,162Updated 5 months ago
- AllenAI's post-training codebase☆3,018Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,622Updated 3 months ago
- ☆1,025Updated 6 months ago
- ☆1,229Updated 3 months ago
- Witness the aha moment of VLM with less than $3.☆3,785Updated last month
- Textbook on reinforcement learning from human feedback☆1,052Updated this week
- A bibliography and survey of the papers surrounding o1☆1,199Updated 7 months ago
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,130Updated 2 months ago
- NanoGPT (124M) in 3 minutes☆2,699Updated this week
- LIMO: Less is More for Reasoning☆963Updated 2 months ago
- O1 Replication Journey☆1,991Updated 5 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆991Updated last month
- An Open Large Reasoning Model for Real-World Solutions☆1,498Updated 3 weeks ago