rasbt / LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆48,954Updated 3 weeks ago
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- llama3 implementation one matrix multiplication at a time☆14,934Updated 11 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆48,937Updated this week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆50,140Updated 3 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,626Updated 10 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,300Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆8,237Updated 3 weeks ago
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆38,553Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,097Updated last week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆41,189Updated 5 months ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆38,025Updated this week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆2,968Updated 9 months ago
- 🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSe…☆60,368Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆47,245Updated this week
- Open-Sora: Democratizing Efficient Video Production for All☆26,401Updated 2 weeks ago
- Train transformer language models with reinforcement learning.☆13,703Updated this week
- Machine Learning Engineering Open Book☆13,689Updated last week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆21,830Updated 2 weeks ago
- 🙌 OpenHands: Code Less, Make More☆54,121Updated this week
- The official Meta Llama 3 GitHub site☆28,671Updated 3 months ago
- DSPy: The framework for programming—not prompting—language models☆24,189Updated this week
- A list of AI autonomous agents☆17,761Updated 2 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,078Updated last week
- LLM training in simple, raw C/CUDA☆26,563Updated this week
- LLM inference in C/C++☆79,738Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆14,188Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆94,231Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆11,921Updated 4 months ago
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆20,678Updated 2 weeks ago
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆33,273Updated last week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆37,936Updated this week