rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆84,736Updated last week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- 22AVP103 - Mastery Over Mind: 1st Sem B.Tech CSE (CYS) - ASC, CBE☆33Updated 2 weeks ago
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆74,834Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆20,239Updated last month
- LLM101n: Let's build a Storyteller☆36,281Updated last year
- llama3 implementation one matrix multiplication at a time☆15,241Updated last year
- Awesome-LLM: a curated list of Large Language Model☆26,195Updated 6 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,190Updated 3 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,861Updated 4 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆51,625Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,137Updated this week
- 仅需Python基础,从0构建大语言模型;从0 逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,939Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆69,622Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆67,023Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,293Updated last year
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆6,621Updated 3 weeks ago
- 🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.☆70,062Updated last week
- Explanation to key concepts in ML☆8,513Updated 7 months ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆20,587Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆23,054Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,010Updated this week
- 🔥Highlighting the top ML papers every week.☆12,229Updated 6 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆31,421Updated last week
- A playbook for systematically maximizing the performance of deep learning models.☆29,774Updated last year
- Examples and guides for using the OpenAI API☆71,357Updated this week
- Understanding Deep Learning - Simon J.D. Prince☆9,051Updated 2 weeks ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆23,494Updated last year
- Machine Learning Engineering Open Book☆16,586Updated 2 weeks ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆30,705Updated last week
- Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.☆17,068Updated last year
- A natural language interface for computers☆62,041Updated 2 months ago