djc / instant-segmentLinks
Fast English word segmentation in Rust
☆99Updated 2 weeks ago
Alternatives and similar repositories for instant-segment
Users that are interested in instant-segment are comparing it to the libraries listed below
Sorting:
- Spelling correction & Fuzzy search based on Symmetric Delete spelling correction algorithm.☆137Updated last month
- Rust client for txtai☆111Updated last month
- finalfusion embeddings in Rust☆102Updated last year
- Small and fast library for k-means clustering.☆49Updated last week
- A Rust Vector which swaps to disk based on given parameters☆44Updated last year
- More efficient alternative to `serde_json::Value` which saves memory by interning primitive values and using tagged pointers.☆137Updated 7 months ago
- Lightweight FST-based autocompleter library written in Rust, targeting WebAssembly and data stored in-memory☆32Updated 2 years ago
- A flexible and convenient high-level mmap for zero-copy file I/O.☆115Updated 4 months ago
- Xor filters - efficient probabilistic hashsets. Faster and smaller than bloom and cuckoo filters.☆140Updated last year
- Simple string matching with single- and multiple-wildcard operator☆87Updated 9 months ago
- A rust implementation of some popular snowball stemming algorithms☆127Updated last year
- fastText Rust binding☆60Updated last year
- Fast approximate nearest neighbor searching in Rust, based on HNSW index☆329Updated last week
- Parallel iterator processing library for Rust☆103Updated last year
- Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)☆117Updated 10 months ago
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆78Updated last year
- Rust implementation of JMESPath, a query language for JSON☆139Updated last week
- Rust FFI wrapper for CRoaring☆158Updated 2 weeks ago
- Common stop words in a variety of languages☆21Updated 4 months ago
- This is a Rust implementation for popular caches (support no_std).☆111Updated 5 months ago
- A pure-Rust two-level dynamic b-tree. This crate implements a compact set data structure that preserves its elements' sorted order and a…☆98Updated 2 weeks ago
- Context-sensitive word embeddings with subwords. In Rust.☆87Updated last year
- Rust ULID (Universally Unique Lexicographically Sortable Identifier) generation and processing☆43Updated 6 months ago
- ☆66Updated 2 years ago
- String optimized for map keys☆65Updated 2 weeks ago
- ☆29Updated 2 years ago
- Fire-forged cluster management & Distributed data protocol☆77Updated 3 years ago
- Hidden Markov Models in Rust☆76Updated last year
- Rust wrapper for the BlingFire tokenization library☆15Updated 5 years ago
- Diff library with semantic cleanup, based on Google's diff-match-patch☆216Updated last month