kuprel / minbpe-pytorch

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
35Updated 6 months ago

Related projects: