rasbt / LLMs-from-scratchLinks

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

☆59,443

Alternatives and similar repositories for LLMs-from-scratch

Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below

Sorting:

mlabonne / llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
☆57,824Updated last month
naklecha / llama3-from-scratch
llama3 implementation one matrix multiplication at a time
☆15,050Updated last year
karpathy / LLM101n
LLM101n: Let's build a Storyteller
☆34,055Updated 11 months ago
datawhalechina / llms-from-scratch-cn
仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理
☆3,342Updated 11 months ago
stanford-oval / storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
☆26,959Updated 3 weeks ago
RUCAIBox / LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
☆11,694Updated 4 months ago
Lightning-AI / litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆12,547Updated this week
Hannibal046 / Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
☆24,321Updated this week
microsoft / graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆26,742Updated this week
karpathy / minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆9,772Updated last year
hiyouga / LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆54,711Updated this week
karpathy / llm.c
LLM training in simple, raw C/CUDA
☆27,216Updated 3 weeks ago
Mooler0410 / LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
☆9,991Updated last year
stas00 / ml-engineering
Machine Learning Engineering Open Book
☆14,454Updated last week
HandsOnLLM / Hands-On-Large-Language-Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
☆13,119Updated this week
karpathy / nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆43,152Updated 7 months ago
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆17,654Updated this week
unslothai / unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
☆42,597Updated this week
NielsRogge / Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
☆11,102Updated 3 weeks ago
aishwaryanr / awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
☆13,438Updated last month
microsoft / autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…
☆47,688Updated this week
jingyaogong / minimind
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!
☆23,419Updated 2 months ago
skindhu / Build-A-Large-Language-Model-CN
《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书，适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材，我决定将其翻译成中文，并…
☆1,933Updated this week
google-research / tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
☆28,953Updated last year
abhishekkrthakur / approachingalmost
Approaching (Almost) Any Machine Learning Problem
☆8,013Updated 2 years ago
liguodongiot / llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
☆19,631Updated 2 weeks ago
udlbook / udlbook
Understanding Deep Learning - Simon J.D. Prince
☆7,665Updated 2 weeks ago
microsoft / generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
☆92,030Updated last week
Lordog / dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
☆7,085Updated last week
stanfordnlp / dspy
DSPy: The framework for programming—not prompting—language models
☆26,664Updated this week