karpathy/rustbpe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/karpathy/rustbpe)

karpathy / rustbpe

The missing tiktoken training code

☆502

Alternatives and similar repositories for rustbpe

Users that are interested in rustbpe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

karpathy / hn-time-capsule
View on GitHub
Analyzing Hacker News discussions from a decade ago in hindsight with LLMs
☆668Dec 10, 2025Updated 7 months ago
1rgs / nanocode
View on GitHub
Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.
☆2,524Jan 14, 2026Updated 6 months ago
karpathy / nanochat
View on GitHub
The best ChatGPT that $100 can buy.
☆56,732Jul 4, 2026Updated 3 weeks ago
karpathy / rendergit
View on GitHub
Render any git repo into a single static HTML page for humans or LLMs
☆2,416Aug 21, 2025Updated 11 months ago
karpathy / reader3
View on GitHub
Quick illustration of how one can easily read books together with LLMs. It's great and I highly recommend it.
☆3,806Nov 18, 2025Updated 8 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
huggingface / picotron
View on GitHub
Minimalistic 4D-parallelism distributed training framework for education purpose
☆2,260Aug 26, 2025Updated 11 months ago
karpathy / transformers
View on GitHub
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆250Jun 27, 2022Updated 4 years ago
karpathy / calorie
View on GitHub
nice and effective super simple calorie counter web app
☆160May 30, 2024Updated 2 years ago
KellerJordan / modded-nanogpt
View on GitHub
NanoGPT (124M) in 90 seconds
☆5,600Updated this week
karpathy / sqlitedict
View on GitHub
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
☆49Feb 21, 2020Updated 6 years ago
changjonathanc / flex-nano-vllm
View on GitHub
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆356Nov 2, 2025Updated 8 months ago
karpathy / researchpooler
View on GitHub
Automating research publications discovery and analysis. For example, ever wish your computer could automatically open papers that are mo…
☆492Sep 1, 2023Updated 2 years ago
karpathy / micrograd
View on GitHub
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
☆16,889Aug 8, 2024Updated last year
karpathy / notpygamejs
View on GitHub
Game making library for using Canvas element
☆109Oct 17, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
huggingface / nanoVLM
View on GitHub
The simplest, fastest repository for training/finetuning small-sized VLMs.
☆4,969Oct 27, 2025Updated 9 months ago
karpathy / nanoGPT
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆61,632Nov 12, 2025Updated 8 months ago
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,410Updated this week
karpathy / optim
View on GitHub
A numeric optimization package for Torch.
☆42Aug 19, 2021Updated 4 years ago
karpathy / minbpe
View on GitHub
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆10,647Jul 1, 2024Updated 2 years ago
meta-pytorch / monarch
View on GitHub
PyTorch Single Controller
☆1,065Updated this week
openai / gpt-oss
View on GitHub
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
☆20,266Updated this week
karpathy / karpathy.github.io
View on GitHub
my blog
☆1,825Apr 10, 2026Updated 3 months ago
karpathy / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆30,663Jun 26, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
taidopurason / tokenizer-extension
View on GitHub
☆15Dec 4, 2025Updated 7 months ago
karpathy / scholaroctopus
View on GitHub
A set of tools/pages that help explore academic literature
☆88Aug 11, 2014Updated 11 years ago
karpathy / autoresearch
View on GitHub
AI agents running research on single-GPU nanochat training automatically
☆92,224Mar 26, 2026Updated 4 months ago
karpathy / llm-council
View on GitHub
LLM Council works together to answer your hardest questions
☆23,300Nov 22, 2025Updated 8 months ago
pytorch / torchtitan
View on GitHub
A PyTorch native platform for training generative AI models
☆5,568Updated this week
lucidrains / fast-weight-product-key-memory
View on GitHub
Implementation of the fast weight product key memory from Sakana AI
☆19Apr 1, 2026Updated 3 months ago
linkedin / Liger-Kernel
View on GitHub
Efficient Triton Kernels for LLM Training
☆6,537Updated this week
clu0 / unet.cu
View on GitHub
UNet diffusion model in pure CUDA
☆661Jun 28, 2024Updated 2 years ago
ash-01xor / bpe.c
View on GitHub
Simple Byte pair Encoding mechanism used for tokenization process . written purely in C
☆151Nov 11, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hiverge / cifar10-speedrun
View on GitHub
CIFAR-10 speedrun: Trains to 94% accuracy in 1.98 seconds on a single NVIDIA A100 GPU.
☆79Oct 17, 2025Updated 9 months ago
policy-gradient / GRPO-Zero
View on GitHub
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,883Apr 18, 2025Updated last year
Quentin-Anthony / nanoMPI
View on GitHub
Simple MPI implementation for prototyping or learning
☆325Aug 6, 2025Updated 11 months ago
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,768May 26, 2026Updated 2 months ago
deepseek-ai / Engram
View on GitHub
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
☆4,568Jan 14, 2026Updated 6 months ago
microsoft / ArchScale
View on GitHub
Simple & Scalable Pretraining for Neural Architecture Research
☆340Mar 31, 2026Updated 3 months ago
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,759Updated this week