KaihuaTang / Building-a-Small-LLM-from-ScratchView external linksLinks
该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及 训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。
☆379Aug 28, 2025Updated 5 months ago
Alternatives and similar repositories for Building-a-Small-LLM-from-Scratch
Users that are interested in Building-a-Small-LLM-from-Scratch are comparing it to the libraries listed below
Sorting:
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆859Dec 10, 2025Updated 2 months ago
- ☆134Feb 17, 2025Updated 11 months ago
- AI Agent 开发实战☆977Nov 30, 2024Updated last year
- Large Language Model in Action☆342Jan 28, 2025Updated last year
- LLMs-from-scratch项目中文翻译☆2,305Oct 15, 2025Updated 3 months ago
- DeepSeek 系列工作解读、扩展和复现。☆700Mar 29, 2025Updated 10 months ago
- ☆49Feb 5, 2025Updated last year
- 从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch☆37Jul 1, 2024Updated last year
- ☆66Feb 13, 2025Updated last year
- ☆54Nov 14, 2024Updated last year
- A tiny, didactical implementation of LLAMA 3☆42Dec 2, 2024Updated last year
- DDIA 逐章精读☆532Nov 27, 2025Updated 2 months ago
- A book for Learning the Foundations of LLMs☆15,742Dec 12, 2025Updated 2 months ago
- A powerful Golang CLI application scaffold integrated with Logrus, arg parser, toml config, testify, Makefile, VSCode and Github Action.☆19Nov 2, 2023Updated 2 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software☆339Feb 5, 2025Updated last year
- 手写一个迷你版本的Tomcat,实现了静态、动态资源的访问。☆10Dec 27, 2020Updated 5 years ago
- Detect and remove unused dependencies for Python projects☆18Apr 5, 2025Updated 10 months ago
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆39,326Feb 6, 2026Updated last week
- 推荐系统入门指南,全面介绍了工业级推荐系统的理论知识(王树森推荐系统公开课-基于小红书的场景讲解工业界真实的推荐系统),如何基于TensorFlow2训练模型,如何实现高性能、高并发、高可用的Golang推理微服务。Comprehensively introduced th…☆688Feb 10, 2025Updated last year
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆545Sep 8, 2025Updated 5 months ago
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆161May 30, 2025Updated 8 months ago
- A lightweight operating system abstraction layer for agents.☆17Dec 26, 2025Updated last month
- 大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"☆1,610Feb 4, 2026Updated last week
- A command line tool that displays information about the current system, including hardware and critical software.☆29Dec 24, 2025Updated last month
- 系统设计面试:内幕指南(System Design Interview: An Insider’s Guide)☆2,609Feb 6, 2026Updated last week
- 利用LLM构建应用实践笔记☆768Nov 29, 2024Updated last year
- 顾名思义:手搓的RAG☆132Feb 27, 2024Updated last year
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆4,250Sep 2, 2025Updated 5 months ago
- network pinger with UI☆14Feb 12, 2024Updated 2 years ago
- 将Alist一键注册为WIndows服务☆17Mar 10, 2025Updated 11 months ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆512May 10, 2024Updated last year
- 《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。☆4,405Jan 27, 2025Updated last year
- 🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.☆7,501Feb 6, 2026Updated last week
- 遇事不决,Vibe 力学! One-Person Company AI Tools Series – continuously updated to help boost productivity and empower your solo business!☆2,628May 8, 2025Updated 9 months ago
- RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.☆2,769Dec 8, 2025Updated 2 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆121Oct 16, 2025Updated 3 months ago
- A Simple Operating System Written by Me for i386-cpu, Created From Scratch☆13Sep 24, 2018Updated 7 years ago
- implement a simple jvm with java☆103Mar 7, 2024Updated last year