karpathy / build-nanogptLinks
Video+code lecture on building nanoGPT from scratch
☆4,156Updated 10 months ago
Alternatives and similar repositories for build-nanogpt
Users that are interested in build-nanogpt are comparing it to the libraries listed below
Sorting:
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,700Updated 11 months ago
- nanoGPT style version of Llama 3.1☆1,380Updated 10 months ago
- NanoGPT (124M) in 3 minutes☆2,660Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,993Updated 2 months ago
- PyTorch native post-training library☆5,273Updated this week
- ☆4,084Updated last year
- A PyTorch native platform for training generative AI models☆3,933Updated this week
- The n-gram Language Model☆1,424Updated 10 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,293Updated this week
- llama3 implementation one matrix multiplication at a time☆15,001Updated last year
- Tools for merging pretrained large language models.☆5,829Updated this week
- Train transformer language models with reinforcement learning.☆14,193Updated this week
- The official PyTorch implementation of Google's Gemma models☆5,485Updated 3 weeks ago
- Robust recipes to align language models with human and AI preferences☆5,232Updated last month
- Inference Llama 2 in one file of pure C☆18,475Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆42,030Updated 6 months ago
- An autoregressive character-level language model for making more things☆3,128Updated last year
- AllenAI's post-training codebase☆3,018Updated this week
- LLM training in simple, raw C/CUDA☆26,906Updated last month
- Code for BLT research paper☆1,686Updated 3 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,548Updated 2 weeks ago
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆12,122Updated 10 months ago
- LLM101n: Let's build a Storyteller☆33,643Updated 10 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,417Updated 2 months ago
- ☆1,225Updated 3 months ago
- ☆2,965Updated 9 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,490Updated this week
- Modeling, training, eval, and inference code for OLMo☆5,689Updated last week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,418Updated this week
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,542Updated last week