aburkov / theLMbookLinks
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
☆2,087Updated last month
Alternatives and similar repositories for theLMbook
Users that are interested in theLMbook are comparing it to the libraries listed below
Sorting:
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,286Updated 3 months ago
- Textbook on reinforcement learning from human feedback☆1,560Updated this week
- Democratizing Reinforcement Learning for LLMs☆5,081Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆2,050Updated 2 months ago
- A reading list on LLM based Synthetic Data Generation 🔥☆1,516Updated 8 months ago
- Simple RL training for reasoning☆3,830Updated last month
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,762Updated 9 months ago
- ☆1,345Updated 11 months ago
- Scalable RL solution for advanced reasoning of language models☆1,803Updated 10 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆3,975Updated 2 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,511Updated 2 weeks ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,727Updated 9 months ago
- Curated list of datasets and tools for post-training.☆4,229Updated 3 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,228Updated 2 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,076Updated 5 months ago
- AllenAI's post-training codebase☆3,573Updated this week
- ☆2,583Updated last month
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,316Updated 3 months ago
- Building DeepSeek R1 from Scratch☆744Updated 10 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,293Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,625Updated 3 months ago
- Fully open data curation for reasoning models☆2,206Updated 2 months ago
- Official Repo for Open-Reasoner-Zero☆2,087Updated 8 months ago
- DataComp for Language Models☆1,416Updated 5 months ago
- O1 Replication Journey☆2,000Updated last year
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,068Updated last year
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,575Updated 2 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,602Updated 3 weeks ago
- Summarize existing representative LLMs text datasets.☆1,431Updated 4 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,062Updated 6 months ago