aburkov / theLMbookLinks
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
☆2,045Updated last week
Alternatives and similar repositories for theLMbook
Users that are interested in theLMbook are comparing it to the libraries listed below
Sorting:
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,212Updated 2 months ago
- Textbook on reinforcement learning from human feedback☆1,354Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,687Updated 7 months ago
- ☆2,220Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,380Updated last month
- ☆1,327Updated 9 months ago
- Building DeepSeek R1 from Scratch☆730Updated 8 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,917Updated 3 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,442Updated 2 weeks ago
- Simple RL training for reasoning☆3,808Updated 4 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,205Updated 2 months ago
- Democratizing Reinforcement Learning for LLMs☆4,854Updated this week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,667Updated 7 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,997Updated 2 weeks ago
- DataComp for Language Models☆1,398Updated 3 months ago
- Scalable RL solution for advanced reasoning of language models☆1,783Updated 8 months ago
- Curated list of datasets and tools for post-training.☆4,083Updated last month
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆3,649Updated last month
- A reading list on LLM based Synthetic Data Generation 🔥☆1,488Updated 6 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆574Updated last year
- AllenAI's post-training codebase☆3,417Updated this week
- Fully open data curation for reasoning models☆2,170Updated 2 weeks ago
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,499Updated 3 weeks ago
- A straightforward method for training your LLM, from downloading data to generating text.☆487Updated 4 months ago
- Implement a reasoning LLM in PyTorch from scratch, step by step☆2,182Updated last week
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆714Updated 2 months ago
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,047Updated 11 months ago
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,147Updated last month
- Summarize existing representative LLMs text datasets.☆1,399Updated 2 months ago
- Machine Learning Journal for Intermediate to Advanced Topics.☆2,248Updated 3 months ago