guillaume-be / rust-tokenizersLinks

Rust-tokenizer offers high-performance tokenizers for modern language models, including WordPiece, Byte-Pair Encoding (BPE) and Unigram (SentencePiece) models

☆323

Alternatives and similar repositories for rust-tokenizers

Users that are interested in rust-tokenizers are comparing it to the libraries listed below

Sorting:

Enet4 / faiss-rs
Rust language bindings for Faiss
☆225Updated 5 months ago
cpcdoy / rust-sbert
Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)
☆118Updated 10 months ago
huggingface / hf-hub
Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package
☆219Updated last month
gaxler / llama2.rs
Inference Llama 2 in one file of pure Rust 🦀
☆233Updated last year
jean-pierreBoth / hnswlib-rs
Rust implementation of the HNSW algorithm (Malkov-Yashunin)
☆200Updated last month
nbigaouette / onnxruntime-rs
Rust wrapper for Microsoft's ONNX Runtime (version 1.8)
☆303Updated last year
finalfusion / finalfusion-rust
finalfusion embeddings in Rust
☆102Updated last year
messense / fasttext-rs
fastText Rust binding
☆61Updated last year
EricLBuehler / candle-lora
Low rank adaptation (LoRA) for Candle.
☆152Updated 3 months ago
rust-cv / hnsw
HNSW ANN from the paper "Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs"
☆237Updated 6 months ago
ToluClassics / candle-tutorial
Tutorial for Porting PyTorch Transformer Models to Candle (Rust)
☆308Updated last year
Gadersd / llama2-burn
Llama2 LLM ported to Rust burn
☆280Updated last year
djc / instant-distance
Fast approximate nearest neighbor searching in Rust, based on HNSW index
☆330Updated 3 weeks ago
santiagomed / orca
LLM Orchestrator built in Rust
☆281Updated last year
mklf / word2vec-rs
pure rust implemention of word2vec
☆83Updated 2 years ago
pgvector / pgvector-rust
pgvector support for Rust
☆175Updated 2 months ago
tracel-ai / models
Models and examples built with Burn
☆263Updated last month
ritchie46 / lsh-rs
Locality Sensitive Hashing in Rust with Python bindings
☆116Updated 2 years ago
charles-r-earp / autograph
A machine learning library for Rust.
☆328Updated 11 months ago
qdrant / rust-client
Rust client for Qdrant vector search engine
☆317Updated 2 weeks ago
ssoudan / tch-m1
Example of tch-rs on M1
☆54Updated last year
CurrySoftware / rust-stemmers
A rust implementation of some popular snowball stemming algorithms
☆127Updated last year
e-tornike / best-of-ml-rust
🏆 A ranked list of awesome machine learning Rust libraries.
☆404Updated last month
meilisearch / arroy
An Approximate Nearest Neighbors library in Rust, based on random projections and LMDB and optimized for memory usage
☆276Updated 3 weeks ago
tensordot / syntaxdot
Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.
☆78Updated last year
Anush008 / fastembed-rs
Rust library for generating vector embeddings, reranking.
☆562Updated 3 weeks ago
Gadersd / whisper-burn
A Rust implementation of OpenAI's Whisper model using the burn framework
☆319Updated last year
robertknight / rten
ONNX neural network inference engine
☆221Updated this week
coreylowman / llama-dfdx
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
☆108Updated 2 years ago
jeroenvlek / gpt-from-scratch-rs
Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle
☆73Updated last year