rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆71,422Updated this week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- llama3 implementation one matrix multiplication at a time☆15,148Updated last year
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆15,618Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆58,035Updated this week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,518Updated last year
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆62,266Updated 3 months ago
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆69,076Updated this week
- Awesome-LLM: a curated list of Large Language Model☆25,016Updated last month
- In-depth tutorials on LLMs, RAGs and real-world AI agent applications.☆18,064Updated this week
- LLM training in simple, raw C/CUDA☆27,588Updated 2 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,748Updated this week
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开 发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆2,242Updated last week
- Machine Learning Engineering Open Book☆15,076Updated this week
- The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data 🔥☆56,708Updated this week
- LLM101n: Let's build a Storyteller☆34,375Updated last year
- Inference code for Llama models☆58,737Updated 7 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less…☆45,445Updated this week
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆5,484Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆58,439Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,856Updated last week
- Open-Sora: Democratizing Efficient Video Production for All☆27,175Updated 4 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆17,823Updated this week
- DSPy: The framework for programming—not prompting—language models☆28,156Updated this week
- The official GitHub page for the survey paper "A Survey of Large Language Models".☆11,807Updated 6 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,741Updated last year
- A collection of MCP servers.☆69,974Updated last week
- Generative Agents: Interactive Simulacra of Human Behavior☆19,600Updated last year
- [WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)☆8,963Updated 7 months ago
- A generative world for general-purpose robotics & embodied AI learning.☆27,236Updated this week
- Latest Advances on Multimodal Large Language Models☆16,241Updated 2 weeks ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,096Updated 3 months ago