aburkov / theLMbookLinks
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
β1,753Updated last week
Alternatives and similar repositories for theLMbook
Users that are interested in theLMbook are comparing it to the libraries listed below
Sorting:
- A reading list on LLM based Synthetic Data Generation π₯β1,280Updated last week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,385Updated 4 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β1,876Updated this week
- An Open-source RL System from ByteDance Seed and Tsinghua AIRβ1,261Updated 2 weeks ago
- Democratizing Reinforcement Learning for LLMsβ3,291Updated 2 weeks ago
- AllenAI's post-training codebaseβ2,986Updated this week
- Recipes to scale inference-time compute of open modelsβ1,073Updated last week
- Large Concept Models: Language modeling in a sentence representation spaceβ2,206Updated 4 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ2,356Updated last week
- Scalable RL solution for advanced reasoning of language modelsβ1,587Updated 2 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,563Updated last week
- Simple RL training for reasoningβ3,584Updated last month
- LIMO: Less is More for Reasoningβ944Updated last month
- Curated list of datasets and tools for post-training.β3,096Updated 4 months ago
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,120Updated 4 months ago
- Official Repo for Open-Reasoner-Zeroβ1,930Updated last month
- Fully open data curation for reasoning modelsβ1,793Updated last week
- Awesome Reasoning LLM Tutorial/Survey/Guideβ1,644Updated this week
- β1,020Updated 5 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,372Updated last month
- Synthetic data curation for post-training and structured data extractionβ1,352Updated last week
- Large Reasoning Modelsβ804Updated 5 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Modelsβ892Updated 2 weeks ago
- β539Updated last month
- System 2 Reasoning Link Collectionβ834Updated 2 months ago
- An Open Large Reasoning Model for Real-World Solutionsβ1,494Updated this week
- β656Updated last month
- Textbook on reinforcement learning from human feedbackβ938Updated this week
- Code for BLT research paperβ1,664Updated last week
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,065Updated 4 months ago