rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆82,261Updated this week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆72,395Updated 2 weeks ago
- llama3 implementation one matrix multiplication at a time☆15,219Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,071Updated this week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,862Updated last year
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆19,437Updated 3 weeks ago
- DSPy: The framework for programming—not prompting—language models☆31,170Updated last week
- LLM101n: Let's build a Storyteller☆36,090Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆50,236Updated this week
- Machine Learning Engineering Open Book☆16,141Updated 2 weeks ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,242Updated last year
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,763Updated 3 months ago
- Awesome-LLM: a curated list of Large Language Model☆25,938Updated 5 months ago
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆2,933Updated 4 months ago
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆6,336Updated 3 weeks ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆23,215Updated last year
- ☆5,679Updated 11 months ago
- Explanation to key concepts in ML☆8,252Updated 6 months ago
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information r…☆23,863Updated 3 weeks ago
- LLM training in simple, raw C/CUDA☆28,510Updated 6 months ago
- We write your reusable computer vision tools. 💜☆36,243Updated 2 weeks ago
- 🔥Highlighting the top ML papers every week.☆12,180Updated 5 months ago
- Neural Networks: Zero to Hero☆19,497Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆51,606Updated last month
- Minimal reproduction of DeepSeek R1-Zero☆12,571Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆66,734Updated this week
- Inference Llama 2 in one file of pure C☆19,063Updated last year
- LLMs-from-scratch项目中文翻译☆2,179Updated 2 months ago
- A list of AI autonomous agents☆24,909Updated 10 months ago
- Video+code lecture on building nanoGPT from scratch☆4,648Updated last year
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆158,762Updated this week