rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆74,452Updated last week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- llama3 implementation one matrix multiplication at a time☆15,162Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆46,548Updated this week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,589Updated last year
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆28,517Updated this week
- MLX: An array framework for Apple silicon☆22,395Updated this week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆64,506Updated 4 months ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆16,146Updated 2 months ago
- Awesome-LLM: a curated list of Large Language Model☆25,197Updated 2 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,475Updated last week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,925Updated this week
- In-depth tutorials on LLMs, RAGs and real-world AI agent applications.☆18,406Updated last week
- We write your reusable computer vision tools. 💜☆35,447Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆111,447Updated this week
- Machine Learning Engineering Open Book☆15,386Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,817Updated last week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆70,812Updated this week
- An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large…☆17,866Updated last month
- Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆40,796Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆44,823Updated 9 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆44,575Updated this week
- LLM training in simple, raw C/CUDA☆27,769Updated 3 months ago
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆2,354Updated last month
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,975Updated last year
- DSPy: The framework for programming—not prompting—language models☆28,825Updated this week
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆153,598Updated this week
- 12 Lessons to Get Started Building AI Agents☆41,995Updated this week
- 12 Weeks, 24 Lessons, AI for All!☆42,947Updated this week
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆5,588Updated last week
- Python scraper based on AI☆21,411Updated this week
- LLM inference in C/C++☆87,149Updated this week