rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆57,201Updated this week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆56,441Updated 3 weeks ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆10,859Updated 3 weeks ago
- LLM101n: Let's build a Storyteller☆33,844Updated 11 months ago
- llama3 implementation one matrix multiplication at a time☆15,017Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,380Updated this week
- Awesome-LLM: a curated list of Large Language Model☆23,985Updated last month
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,267Updated 10 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆26,782Updated 2 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,719Updated last year
- Machine Learning Engineering Open Book☆14,126Updated this week
- A one stop repository for generative AI research updates, interview resources, notebooks and much more!☆12,986Updated 3 weeks ago
- 🐙 Guides, papers, lecture, notebooks and resources for prompt engineering☆58,612Updated last week
- A list of AI autonomous agents☆19,061Updated 4 months ago
- Simple, unified interface to multiple Generative AI providers☆12,200Updated last month
- 21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/☆89,273Updated this week
- 3D Visualization of an GPT-style LLM☆4,755Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆50,864Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆20,003Updated 3 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.☆41,413Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆42,633Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆42,415Updated 6 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆53,115Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆26,164Updated last week
- An open-source RAG-based tool for chatting with your documents.☆22,715Updated 3 weeks ago
- 🔥Highlighting the top ML papers every week.☆11,526Updated 3 weeks ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆24,658Updated this week
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.☆6,757Updated 2 weeks ago
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆7,756Updated 11 months ago
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.☆18,405Updated 2 months ago
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆4,726Updated 3 weeks ago