Rust-tokenizer offers high-performance tokenizers for modern language models, including WordPiece, Byte-Pair Encoding (BPE) and Unigram (SentencePiece) models
☆335Jan 22, 2026Updated last month
Alternatives and similar repositories for rust-tokenizers
Users that are interested in rust-tokenizers are comparing it to the libraries listed below
Sorting:
- Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)☆3,042Jan 13, 2026Updated last month
- Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)☆125Sep 17, 2024Updated last year
- Rust bindings for the C++ api of PyTorch.☆5,302Jan 22, 2026Updated last month
- Fast ML inference & training for ONNX models in Rust☆2,042Updated this week
- 🦀 Example of serving deep learning models in Rust with batched prediction☆34Mar 9, 2023Updated 2 years ago
- Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference☆2,803Updated this week
- Rust wrapper for Microsoft's ONNX Runtime (version 1.8)☆318Mar 6, 2024Updated 2 years ago
- Rust language bindings for Faiss☆249Nov 15, 2025Updated 3 months ago
- 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production☆10,497Feb 28, 2026Updated last week
- The most accurate natural language detection library for Rust, suitable for short text and mixed-language text☆1,061Updated this week
- finalfusion embeddings in Rust☆105Oct 10, 2023Updated 2 years ago
- fastText Rust binding☆65Jan 7, 2024Updated 2 years ago
- A Rust machine learning framework.☆4,563Feb 4, 2026Updated last month
- Rust implementation of the HNSW algorithm (Malkov-Yashunin)☆232Updated this week
- A rust interface to http://openml.org/☆12Jul 13, 2019Updated 6 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Deep learning in Rust, with shape checked tensors and neural networks☆1,896Jul 23, 2024Updated last year
- Common stop words in a variety of languages☆25Feb 21, 2026Updated 2 weeks ago
- HNSW ANN from the paper "Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs"☆257Aug 8, 2025Updated 6 months ago
- Allows conversion between ndarray's types and image's types☆13Jun 26, 2021Updated 4 years ago
- Rust client for Qdrant vector search engine☆390Updated this week
- Context-sensitive word embeddings with subwords. In Rust.☆90Oct 20, 2023Updated 2 years ago
- Rust port of https://github.com/UKPLab/sentence-transformers☆31Apr 18, 2020Updated 5 years ago
- pure rust implemention of word2vec☆86May 8, 2023Updated 2 years ago
- Rust binding for the sentencepiece library☆25Jan 13, 2026Updated last month
- Rust client for txtai☆114Feb 25, 2026Updated last week
- Fast approximate nearest neighbor searching in Rust, based on HNSW index☆344Feb 10, 2026Updated 3 weeks ago
- ☆13Nov 4, 2023Updated 2 years ago
- Rust client for OpenAI API☆103Sep 13, 2023Updated 2 years ago
- Graph data structure library for Rust.☆3,773Feb 21, 2026Updated 2 weeks ago
- Minimalist ML framework for Rust☆19,509Feb 28, 2026Updated last week
- A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web☆1,745Jul 21, 2024Updated last year
- Ready-made tokenizer library for working with GPT and tiktoken☆371Feb 18, 2026Updated 2 weeks ago
- [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models☆6,150Jun 24, 2024Updated last year
- Pure Rust port of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆30Updated this week
- An implementation of the diffusers api in Rust☆586Apr 4, 2024Updated last year
- ndarray: an N-dimensional array with array views, multidimensional slicing, and efficient operations☆4,223Feb 16, 2026Updated 2 weeks ago
- Tensors and differentiable operations (like TensorFlow) in Rust☆500Feb 11, 2023Updated 3 years ago
- An OpenAI-powered triage bot for a slack support channel designed to tag oncalls, prioritize issues, suggest solutions, and streamline co…☆12Jun 11, 2025Updated 8 months ago