aburkov / theLMbookLinks
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
☆1,936Updated 4 months ago
Alternatives and similar repositories for theLMbook
Users that are interested in theLMbook are comparing it to the libraries listed below
Sorting:
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,089Updated 3 months ago
- Textbook on reinforcement learning from human feedback☆1,259Updated last week
- ☆1,292Updated 7 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,662Updated this week
- Democratizing Reinforcement Learning for LLMs☆4,414Updated last week
- A reading list on LLM based Synthetic Data Generation 🔥☆1,427Updated 4 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,100Updated last month
- ☆1,467Updated this week
- Simple RL training for reasoning☆3,753Updated 2 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,846Updated last month
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,596Updated 5 months ago
- Fully open data curation for reasoning models☆2,109Updated last month
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆3,267Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,570Updated 5 months ago
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,035Updated 9 months ago
- Curated list of datasets and tools for post-training.☆3,777Updated 2 months ago
- NanoGPT (124M) in 3 minutes☆3,176Updated 2 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,330Updated last week
- Building DeepSeek R1 from Scratch☆704Updated 6 months ago
- It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.☆849Updated last year
- Summarize existing representative LLMs text datasets.☆1,360Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,987Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,895Updated this week
- DataComp for Language Models☆1,371Updated last month
- nanoGPT style version of Llama 3.1☆1,429Updated last year
- Implement a reasoning LLM in PyTorch from scratch, step by step☆1,621Updated this week
- Machine Learning Journal for Intermediate to Advanced Topics.☆2,206Updated last month
- Witness the aha moment of VLM with less than $3.☆3,950Updated 4 months ago
- A course on aligning smol models.☆6,440Updated last week
- Large Concept Models: Language modeling in a sentence representation space☆2,290Updated 8 months ago