rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆63,583Updated last week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆59,592Updated 2 months ago
- llama3 implementation one matrix multiplication at a time☆15,097Updated last year
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,423Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,620Updated last week
- LLM101n: Let's build a Storyteller☆34,218Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆43,570Updated 8 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less…☆43,914Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,119Updated last month
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,744Updated this week
- Fully open reproduction of DeepSeek-R1☆25,270Updated last week
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆5,177Updated 2 months ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆22,451Updated last year
- Awesome-LLM: a curated list of Large Language Model☆24,655Updated 2 weeks ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆13,833Updated 3 weeks ago
- 21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/☆94,840Updated last week
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆2,064Updated last week
- Minimal reproduction of DeepSeek R1-Zero☆12,121Updated 3 months ago
- The official GitHub page for the survey paper "A Survey of Large Language Models".☆11,727Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆55,160Updated this week
- 🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, fee…☆62,550Updated this week
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆23,019Updated this week
- ☆11,549Updated 7 months ago
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆24,252Updated 3 months ago
- Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆38,180Updated this week
- DSPy: The framework for programming—not prompting—language models☆27,173Updated this week
- 🔥Highlighting the top ML papers every week.☆11,765Updated 3 weeks ago
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆57,901Updated last month
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆11,378Updated 3 months ago
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆20,161Updated 2 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.☆16,953Updated this week