rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆76,350Updated last week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆66,196Updated 4 months ago
- llama3 implementation one matrix multiplication at a time☆15,180Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,861Updated last week
- LLM training in simple, raw C/CUDA☆27,923Updated 4 months ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆16,850Updated 3 months ago
- Awesome-LLM: a curated list of Large Language Model☆25,366Updated 2 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,064Updated last year
- 21 Lessons, Get Started Building with Generative AI☆100,867Updated this week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,640Updated last year
- LLM101n: Let's build a Storyteller☆35,120Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆47,749Updated 10 months ago
- Machine Learning Engineering Open Book☆15,519Updated last week
- 12 Weeks, 24 Lessons, AI for All!☆43,368Updated this week
- In-depth tutorials on LLMs, RAGs and real-world AI agent applications.☆19,051Updated this week
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆13,335Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆47,355Updated this week
- Simple, unified interface to multiple Generative AI providers☆12,591Updated 2 weeks ago
- DSPy: The framework for programming—not prompting—language models☆29,488Updated this week
- The multi-agent framework, runtime and UI built for speed.☆34,512Updated last week
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆11,620Updated 6 months ago
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,290Updated 3 months ago
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆2,459Updated last month
- Video+code lecture on building nanoGPT from scratch☆4,469Updated last year
- Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆41,695Updated this week
- Official inference framework for 1-bit LLMs☆24,301Updated 4 months ago
- A one stop repository for generative AI research updates, interview resources, notebooks and much more!☆19,825Updated 3 weeks ago
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information r…☆22,525Updated 3 weeks ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆60,848Updated this week
- Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including …☆27,160Updated last year
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆28,781Updated this week