kuprel / minbpe-pytorchLinks
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
☆38Updated last year
Alternatives and similar repositories for minbpe-pytorch
Users that are interested in minbpe-pytorch are comparing it to the libraries listed below
Sorting:
- Simple high-throughput inference library☆125Updated 2 months ago
- ☆130Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- a small code base for training large models☆307Updated 2 months ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆190Updated last week
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- Inference of Mamba models in pure C☆189Updated last year
- 👷 Build compute kernels☆78Updated this week
- Implementation of mamba with rust☆88Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Python bindings for ggml☆143Updated 10 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆35Updated 2 years ago
- NLP with Rust for Python 🦀🐍☆64Updated 2 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 3 months ago
- ☆20Updated 9 months ago
- ☆155Updated 2 years ago
- Because it's there.☆16Updated 10 months ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 10 months ago
- Tree-based indexes for neural-search☆32Updated last year
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated last month
- ☆63Updated 10 months ago
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers☆144Updated 7 months ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆11Updated last year
- ☆38Updated last year
- Lightweight tools for quick and easy LLM demo's☆28Updated 10 months ago
- Modular Rust transformer/LLM library using Candle☆36Updated last year