KaihuaTang / Building-a-Small-LLM-from-ScratchView external linksLinks
该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训 练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。
☆379Aug 28, 2025Updated 5 months ago
Alternatives and similar repositories for Building-a-Small-LLM-from-Scratch
Users that are interested in Building-a-Small-LLM-from-Scratch are comparing it to the libraries listed below
Sorting:
- Learning records for building a large language model from scratch☆58Jan 1, 2025Updated last year
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆859Dec 10, 2025Updated 2 months ago
- ☆134Feb 17, 2025Updated 11 months ago
- AI Agent 开发实战☆977Nov 30, 2024Updated last year
- Large Language Model in Action☆342Jan 28, 2025Updated last year
- A OS toy writen by pure rust☆140Oct 28, 2024Updated last year
- Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and Dee…☆61Mar 18, 2025Updated 10 months ago
- 本项目提供了基于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程,同时也可以作为一份极简的TP学习代码。☆30Jan 6, 2026Updated last month
- 企业级RAG系统从入门到精通☆624Jun 25, 2025Updated 7 months ago
- LLMs-from-scratch项目中文翻译☆2,305Oct 15, 2025Updated 3 months ago
- A beginner-friendly guide to learning JAX with practical examples.☆39Aug 26, 2025Updated 5 months ago
- ☆49Feb 5, 2025Updated last year
- Quadra: Effortless and reproducible deep learning workflows with configuration files.☆50Feb 6, 2026Updated last week
- 这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。☆6,667Nov 10, 2025Updated 3 months ago
- Implementation for ECCV 2022 Paper "Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generali…☆20Jul 18, 2022Updated 3 years ago
- ☆66Feb 13, 2025Updated last year
- ☆17Mar 13, 2023Updated 2 years ago
- convert GitHub issues to a website☆28Updated this week
- DDIA 逐章精读☆532Nov 27, 2025Updated 2 months ago
- A book for Learning the Foundations of LLMs☆15,742Dec 12, 2025Updated 2 months ago
- A powerful Golang CLI application scaffold integrated with Logrus, arg parser, toml config, testify, Makefile, VSCode and Github Action.☆19Nov 2, 2023Updated 2 years ago
- Fetch arxiv data to LLM-friendly text☆128Jan 31, 2026Updated 2 weeks ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- Turn PostgreSQL into your search engine in a Pythonic way.☆51Aug 29, 2025Updated 5 months ago
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software☆339Feb 5, 2025Updated last year
- LaTeX 讲座资料☆12Apr 7, 2022Updated 3 years ago
- Detect and remove unused dependencies for Python projects☆18Apr 5, 2025Updated 10 months ago
- Code for the article series on building a Python compiler and interpreter☆11Feb 13, 2025Updated last year
- Office codebase for ICML 2025 paper "Core Knowledge Deficits in Multi-Modal Language Models"☆21Oct 1, 2025Updated 4 months ago
- 手写一个迷你版本的Tomcat,实现了静态、动态资源的访问。☆10Dec 27, 2020Updated 5 years ago
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆39,326Feb 6, 2026Updated last week
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 5 months ago
- ☆29Jul 4, 2025Updated 7 months ago
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆115Sep 20, 2024Updated last year
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆58May 27, 2025Updated 8 months ago
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆545Sep 8, 2025Updated 5 months ago
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆161May 30, 2025Updated 8 months ago
- A lightweight operating system abstraction layer for agents.☆17Dec 26, 2025Updated last month
- The SwiftUI learning project.☆11Nov 6, 2021Updated 4 years ago