kuprel / minbpe-pytorch

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
35Updated 8 months ago

Related projects

Alternatives and complementary repositories for minbpe-pytorch