kuprel / minbpe-pytorchLinks

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA

☆38

Alternatives and similar repositories for minbpe-pytorch

Users that are interested in minbpe-pytorch are comparing it to the libraries listed below

Sorting:

facebookresearch / fastgen
Simple high-throughput inference library
☆125Updated 2 months ago
LaurentMazare / mamba.rs
☆130Updated last year
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
Cerebras / gigaGPT
a small code base for training large models
☆307Updated 2 months ago
beowolx / rensa
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…
☆190Updated last week
leo-du / llama2.rs
Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust
☆38Updated last year
kroggen / mamba.c
Inference of Mamba models in pure C
☆189Updated last year
huggingface / kernel-builder
👷 Build compute kernels
☆78Updated this week
flawedmatrix / mamba-ssm
Implementation of mamba with rust
☆88Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
abetlen / ggml-python
Python bindings for ggml
☆143Updated 10 months ago
kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
abetlen / program-constrained-language-model-sampling
☆35Updated 2 years ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆64Updated 2 months ago
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 3 months ago
LaurentMazare / glim
☆20Updated 9 months ago
Narsil / fast_gpt2
☆155Updated 2 years ago
charlesfrye / cuda-substrings
Because it's there.
☆16Updated 10 months ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
mikex86 / tritonc
Standalone commandline CLI tool for compiling Triton kernels
☆17Updated 10 months ago
raphaelsty / neural-tree
Tree-based indexes for neural-search
☆32Updated last year
LAION-AI / AIW
Alice in Wonderland code base for experiments and raw experiments data
☆131Updated last month
CERC-AAI / Robin
☆63Updated 10 months ago
fabiocannizzo / FastBinarySearch
Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers
☆144Updated 7 months ago
Chillee / lit-llama
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
☆11Updated last year
xjdr-alt / muzero_sketch
☆38Updated last year
allenai / adapt-demos
Lightweight tools for quick and easy LLM demo's
☆28Updated 10 months ago
oxidized-transformers / oxidized-transformers
Modular Rust transformer/LLM library using Candle
☆36Updated last year