karpathy / LLM101n
LLM101n: Let's build a Storyteller
☆30,214Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for LLM101n
- Machine Learning Engineering Open Book☆11,655Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,195Updated 4 months ago
- llama3 implementation one matrix multiplication at a time☆13,741Updated 5 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆10,734Updated last week
- Video+code lecture on building nanoGPT from scratch☆3,611Updated 3 months ago
- Explanation to key concepts in ML☆7,321Updated this week
- Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory☆18,263Updated this week
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆33,080Updated this week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆39,198Updated 3 months ago
- A one stop repository for generative AI research updates, interview resources, notebooks and much more!☆9,248Updated 2 weeks ago
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆10,520Updated 3 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆19,247Updated this week
- LLM training in simple, raw C/CUDA☆24,460Updated last month
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆3,371Updated last month
- 🔥Highlighting the top ML papers every week.☆10,302Updated this week
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information r…☆8,671Updated last week
- The n-gram Language Model☆1,342Updated 3 months ago
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.☆5,211Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆30,423Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)☆34,589Updated this week
- Neural Networks: Zero to Hero☆11,892Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37,411Updated 3 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆14,240Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆2,281Updated last month
- DSPy: The framework for programming—not prompting—language models☆18,885Updated this week
- An autoregressive character-level language model for making more things☆2,607Updated 5 months ago
- Inference code for CodeLlama models☆16,044Updated 3 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆22,289Updated this week
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆6,852Updated 3 months ago
- ☆8,482Updated last month