rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆84,736Updated last week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- llama3 implementation one matrix multiplication at a time☆15,241Updated last year
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆74,415Updated last month
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆20,239Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆52,642Updated 2 months ago
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,939Updated last year
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆23,451Updated last year
- Simple, unified interface to multiple Generative AI providers☆13,425Updated last month
- LLM training in simple, raw C/CUDA☆28,763Updated 7 months ago
- Understanding Deep Learning - Simon J.D. Prince☆9,051Updated 2 weeks ago
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆3,169Updated 5 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆51,625Updated this week
- LLM101n: Let's build a Storyteller☆36,254Updated last year
- Awesome-LLM: a curated list of Large Language Model☆26,195Updated 6 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,293Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆69,622Updated this week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆72,576Updated last week
- Machine Learning Engineering Open Book☆16,586Updated 2 weeks ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆102,600Updated this week
- Universal memory layer for AI Agents☆46,647Updated this week
- Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.☆17,068Updated last year
- Build resilient language agents as graphs.☆24,291Updated this week
- Explanation to key concepts in ML☆8,265Updated 7 months ago
- Production-ready platform for agentic workflow development.☆128,415Updated last week
- Train transformer language models with reinforcement learning.☆17,297Updated this week
- Code Repository for Machine Learning with PyTorch and Scikit-Learn☆4,969Updated last month
- LLMs-from-scratch项目中文翻译☆2,305Updated 3 months ago
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆6,621Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,137Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,010Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆54,105Updated this week