kuprel / minbpe-pytorchLinks
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
☆41Updated last year
Alternatives and similar repositories for minbpe-pytorch
Users that are interested in minbpe-pytorch are comparing it to the libraries listed below
Sorting:
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆220Updated last week
- ☆135Updated last year
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆39Updated 2 years ago
- Simple high-throughput inference library☆151Updated 7 months ago
- Inference of Mamba models in pure C☆194Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.☆57Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Python bindings for ggml☆146Updated last year
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers☆153Updated 11 months ago
- ☆35Updated 2 years ago
- ☆157Updated 2 years ago
- Efficient vector database for hundred millions of embeddings.☆211Updated last year
- NLP with Rust for Python 🦀🐍☆70Updated 7 months ago
- ☆19Updated last year
- Modular Rust transformer/LLM library using Candle☆37Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated 2 months ago
- ☆138Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- a small code base for training large models☆315Updated 7 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 7 months ago
- ☆39Updated 3 years ago
- Official Rust Implementation of Model2Vec☆143Updated 2 months ago
- RWKV in nanoGPT style☆196Updated last year
- Tree-based indexes for neural-search☆31Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Because it's there.☆16Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Updated last year