rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆78,286Updated this week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆67,241Updated 5 months ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆17,699Updated 3 months ago
- llama3 implementation one matrix multiplication at a time☆15,195Updated last year
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,709Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,918Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆49,134Updated 11 months ago
- Understanding Deep Learning - Simon J.D. Prince☆8,484Updated last week
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆2,604Updated 2 months ago
- A one stop repository for generative AI research updates, interview resources, notebooks and much more!☆20,326Updated 2 weeks ago
- A list of AI autonomous agents☆24,037Updated 8 months ago
- LLM training in simple, raw C/CUDA☆28,139Updated 4 months ago
- 21 Lessons, Get Started Building with Generative AI☆101,499Updated last week
- 🔥Highlighting the top ML papers every week.☆12,069Updated 3 months ago
- DSPy: The framework for programming—not prompting—language models☆29,874Updated last week
- LLM101n: Let's build a Storyteller☆35,520Updated last year
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆22,917Updated last year
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,342Updated 4 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆23,813Updated 2 months ago
- We write your reusable computer vision tools. 💜☆35,876Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆62,211Updated this week
- 🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.☆66,208Updated last week
- Awesome-LLM: a curated list of Large Language Model☆25,501Updated 3 months ago
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆33,580Updated this week
- LLMs-from-scratch项目中文翻译☆1,942Updated 3 weeks ago
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆5,865Updated 3 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,857Updated 3 weeks ago
- Fully open reproduction of DeepSeek-R1☆25,629Updated 2 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆45,140Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,136Updated last year
- Convert PDF to markdown + JSON quickly with high accuracy☆29,799Updated last week