aburkov/theLMbook

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aburkov/theLMbook)

aburkov / theLMbook

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

☆2,169

Alternatives and similar repositories for theLMbook

Users that are interested in theLMbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huggingface / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆26,410Apr 2, 2026Updated 3 months ago
Jiayi-Pan / TinyZero
View on GitHub
Minimal reproduction of DeepSeek R1-Zero
☆13,210Feb 27, 2026Updated 5 months ago
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,699Updated this week
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,871Dec 23, 2025Updated 7 months ago
lsdefine / simple_GRPO
View on GitHub
A very simple GRPO implement for reproducing r1-like LLM thinking.
☆1,698Nov 21, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
naklecha / llama3-from-scratch
View on GitHub
llama3 implementation one matrix multiplication at a time
☆15,223May 23, 2024Updated 2 years ago
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,855Jul 14, 2026Updated 2 weeks ago
rasbt / LLMs-from-scratch
View on GitHub
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆99,963Updated this week
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,740Updated this week
huggingface / smol-course
View on GitHub
A course on aligning smol models.
☆6,706May 26, 2026Updated 2 months ago
mlabonne / llm-course
View on GitHub
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
☆81,286Feb 5, 2026Updated 5 months ago
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,965Updated this week
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,953Updated this week
HandsOnLLM / Hands-On-Large-Language-Models
View on GitHub
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
☆27,847Apr 24, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hiyouga / LlamaFactory
View on GitHub
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆73,582Updated this week
Lightning-AI / litgpt
View on GitHub
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆13,594Jul 20, 2026Updated last week
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,269Aug 27, 2025Updated 11 months ago
karpathy / minbpe
View on GitHub
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆10,647Jul 1, 2024Updated 2 years ago
karpathy / LLM101n
View on GitHub
LLM101n: Let's build a Storyteller
☆37,504Aug 1, 2024Updated last year
PacktPublishing / LLM-Engineers-Handbook
View on GitHub
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
☆5,253Apr 22, 2026Updated 3 months ago
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,170Nov 13, 2025Updated 8 months ago
hijkzzz / Awesome-LLM-Strawberry
View on GitHub
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
☆6,893Dec 17, 2025Updated 7 months ago
huggingface / smollm
View on GitHub
Everything about the SmolLM and SmolVLM family of models
☆3,854May 26, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,434Updated this week
Unakar / Logic-RL
View on GitHub
Reproduce R1 Zero on Logic Puzzle
☆2,450Mar 20, 2025Updated last year
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,317Updated this week
huggingface / nanoVLM
View on GitHub
The simplest, fastest repository for training/finetuning small-sized VLMs.
☆4,969Oct 27, 2025Updated 9 months ago
StarsfieldAI / R1-V
View on GitHub
Witness the aha moment of VLM with less than $3.
☆4,064May 19, 2025Updated last year
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,811Updated this week
meta-llama / llama-cookbook
View on GitHub
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆18,540May 19, 2026Updated 2 months ago
chiphuyen / aie-book
View on GitHub
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
☆16,657Jul 3, 2026Updated 3 weeks ago
karpathy / nanoGPT
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆61,632Nov 12, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rasbt / reasoning-from-scratch
View on GitHub
Implement a reasoning LLM in PyTorch from scratch, step by step
☆4,829Updated this week
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,757Updated this week
modelscope / ms-swift
View on GitHub
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…
☆14,974Updated this week
karpathy / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆30,663Jun 26, 2025Updated last year
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,410Updated this week
karpathy / nanochat
View on GitHub
The best ChatGPT that $100 can buy.
☆56,732Jul 4, 2026Updated 3 weeks ago
huggingface / smolagents
View on GitHub
🤗 smolagents: a barebones library for agents that think in code.
☆28,569Jul 21, 2026Updated last week