djc / instant-segmentLinks
Fast English word segmentation in Rust
☆100Updated last month
Alternatives and similar repositories for instant-segment
Users that are interested in instant-segment are comparing it to the libraries listed below
Sorting:
- Spelling correction & Fuzzy search based on Symmetric Delete spelling correction algorithm.☆139Updated 3 months ago
- Small and fast library for k-means clustering.☆51Updated 3 months ago
- Rust client for txtai☆111Updated last month
- finalfusion embeddings in Rust☆103Updated 2 years ago
- Fast approximate nearest neighbor searching in Rust, based on HNSW index☆334Updated last month
- A rust implementation of some popular snowball stemming algorithms☆129Updated last year
- Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)☆121Updated last year
- A Rust Vector which swaps to disk based on given parameters☆44Updated last year
- Xor filters - efficient probabilistic hashsets. Faster and smaller than bloom and cuckoo filters.☆144Updated last month
- More efficient alternative to `serde_json::Value` which saves memory by interning primitive values and using tagged pointers.☆137Updated 10 months ago
- A simple and lightweight fuzzy search engine that works in memory, searching for similar strings (a pun here).☆179Updated last month
- A flexible and convenient high-level mmap for zero-copy file I/O.☆115Updated 7 months ago
- Locality Sensitive Hashing in Rust with Python bindings☆118Updated 2 years ago
- Rust implementation of JMESPath, a query language for JSON☆147Updated 3 months ago
- ☆67Updated 2 years ago
- Parallel iterator processing library for Rust☆103Updated 2 years ago
- Simple string matching with single- and multiple-wildcard operator☆88Updated 3 weeks ago
- Rust wrapper for the BlingFire tokenization library☆15Updated 5 years ago
- fastText Rust binding☆62Updated last year
- Common stop words in a variety of languages☆23Updated last month
- A simple wrapper around filesystem operations to provide more helpful error messages.☆152Updated this week
- A suite of non-cryptographic hash functions for Rust.☆139Updated 3 years ago
- Texting Robots: A Rust native `robots.txt` parser with thorough unit testing☆28Updated last year
- Context-sensitive word embeddings with subwords. In Rust.☆87Updated last year
- Native Rust port of Google's HighwayHash, which makes use of SIMD instructions for a fast and strong hash function☆172Updated last month
- Extract differences between arbitrary datastructures☆90Updated last year
- Multilingual implementation of RAKE algorithm for Rust☆34Updated 7 months ago
- Mix async code with CPU-heavy thread pools using Tokio + Rayon☆144Updated 2 years ago
- ☆81Updated 4 years ago
- A Web Dashboard for Lorikeet☆85Updated 2 years ago