aburkov / theLMbook
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
☆1,350Updated 3 weeks ago
Alternatives and similar repositories for theLMbook:
Users that are interested in theLMbook are comparing it to the libraries listed below
- Simple RL training for reasoning☆3,326Updated this week
- Awesome Reasoning LLM Tutorial/Survey/Guide☆1,220Updated this week
- A reading list on LLM based Synthetic Data Generation 🔥☆1,221Updated last month
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,265Updated this week
- Democratizing Reinforcement Learning for LLMs☆2,158Updated last month
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆1,466Updated this week
- NanoGPT (124M) in 3 minutes☆2,427Updated last week
- Recipes to scale inference-time compute of open models☆1,048Updated last month
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,736Updated 2 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,094Updated 2 months ago
- Everything about the SmolLM2 and SmolVLM family of models☆2,049Updated last week
- Fully open data curation for reasoning models☆1,591Updated 2 weeks ago
- O1 Replication Journey☆1,980Updated 2 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆5,693Updated this week
- Official Repo for Open-Reasoner-Zero☆1,687Updated 3 weeks ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆915Updated this week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,015Updated 2 months ago
- A library for advanced large language model reasoning☆2,065Updated last month
- ☆1,011Updated 3 months ago
- LIMO: Less is More for Reasoning☆875Updated last month
- Scalable RL solution for advanced reasoning of language models☆1,445Updated last week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆962Updated 3 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆459Updated this week
- Large Concept Models: Language modeling in a sentence representation space☆2,069Updated 2 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆748Updated 3 weeks ago
- System 2 Reasoning Link Collection☆812Updated 2 weeks ago
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,696Updated 3 weeks ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,336Updated this week
- Large Reasoning Models☆800Updated 3 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆693Updated 2 weeks ago