rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆68,653Updated this week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆61,471Updated 3 months ago
- llama3 implementation one matrix multiplication at a time☆15,123Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,691Updated last week
- Machine Learning Engineering Open Book☆14,957Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less…☆44,900Updated last week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,490Updated last year
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆15,039Updated last month
- The official Meta Llama 3 GitHub site☆28,955Updated 7 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,243Updated 2 months ago
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆2,205Updated last week
- LLM101n: Let's build a Storyteller☆34,328Updated last year
- Awesome-LLM: a curated list of Large Language Model☆24,891Updated last month
- Python scraper based on AI☆21,166Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆43,960Updated 8 months ago
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆5,371Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆27,863Updated this week
- 21 Lessons, Get Started Building with Generative AI☆96,485Updated this week
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆19,722Updated 2 weeks ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,797Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆56,914Updated this week
- LLM training in simple, raw C/CUDA☆27,536Updated 2 months ago
- Anthropic's educational courses☆17,021Updated 9 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,908Updated last year
- 🙌 OpenHands: Code Less, Make More☆62,981Updated this week
- A one stop repository for generative AI research updates, interview resources, notebooks and much more!☆16,343Updated last week
- Building a quick conversation-based search demo with Lepton AI.☆8,130Updated last week
- LLMs-from-scratch项目中文翻译☆1,520Updated 4 months ago
- Code Repository for Machine Learning with PyTorch and Scikit-Learn☆4,581Updated 5 months ago
- Fast and memory-efficient exact attention☆19,275Updated last week
- Latest Advances on Multimodal Large Language Models☆16,182Updated this week