aburkov / theLMbookLinks
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
☆2,069Updated 3 weeks ago
Alternatives and similar repositories for theLMbook
Users that are interested in theLMbook are comparing it to the libraries listed below
Sorting:
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,235Updated 2 months ago
- Textbook on reinforcement learning from human feedback☆1,382Updated last week
- ☆1,335Updated 10 months ago
- Simple RL training for reasoning☆3,819Updated 2 weeks ago
- Democratizing Reinforcement Learning for LLMs☆4,942Updated last week
- Fully open data curation for reasoning models☆2,182Updated last month
- A reading list on LLM based Synthetic Data Generation 🔥☆1,496Updated 7 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,729Updated 8 months ago
- Code for BLT research paper☆2,024Updated 2 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,467Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,939Updated 4 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆2,029Updated last month
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆3,763Updated last month
- Building DeepSeek R1 from Scratch☆735Updated 9 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,698Updated 7 months ago
- ☆1,377Updated 3 months ago
- Implement a reasoning LLM in PyTorch from scratch, step by step☆2,372Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,790Updated 9 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,217Updated 3 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,494Updated 2 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,059Updated 5 months ago
- Official Repo for Open-Reasoner-Zero☆2,085Updated 7 months ago
- DataComp for Language Models☆1,404Updated 4 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,015Updated 2 weeks ago
- AllenAI's post-training codebase☆3,515Updated this week
- Witness the aha moment of VLM with less than $3.☆4,020Updated 7 months ago
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,237Updated 2 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆503Updated 5 months ago
- ☆2,344Updated last month
- Summarize existing representative LLMs text datasets.☆1,416Updated 2 months ago