kuprel / minbpe-pytorchLinks
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA
☆41Updated last year
Alternatives and similar repositories for minbpe-pytorch
Users that are interested in minbpe-pytorch are comparing it to the libraries listed below
Sorting:
- ☆135Updated last year
- Simple high-throughput inference library☆155Updated 8 months ago
- wasm bindings for huggingface tokenizers library☆34Updated 3 years ago
- NLP with Rust for Python 🦀🐍☆70Updated 8 months ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆225Updated last week
- ☆39Updated 3 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated 2 years ago
- RWKV in nanoGPT style☆197Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Updated 2 years ago
- ☆35Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Because it's there.☆16Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆39Updated 2 years ago
- Inference of Mamba models in pure C☆196Updated last year
- Tree-based indexes for neural-search☆31Updated last year
- Python bindings for ggml☆146Updated last year
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated 3 months ago
- ☆157Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated 2 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆76Updated last year
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers☆153Updated last year
- ☆138Updated last year
- implement llava using candle☆15Updated last year
- a small code base for training large models☆318Updated 8 months ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Updated 2 years ago
- Fast Text Classification with Compressors dictionary☆150Updated 2 years ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 4 months ago