pisa-engine / kentroLinks
High-Performance K-Means Clustering Library
β39Updated 5 months ago
Alternatives and similar repositories for kentro
Users that are interested in kentro are comparing it to the libraries listed below
Sorting:
- Inference engine for GLiNER models, in Rustβ81Updated last month
- NLP with Rust for Python π¦πβ70Updated 7 months ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasβ¦β223Updated 2 weeks ago
- Official Rust Implementation of Model2Vecβ143Updated 2 months ago
- Efficient BM25 with DuckDB π¦β59Updated last year
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.β32Updated 2 weeks ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram andβ¦β39Updated 2 months ago
- A text embedding extension for the Polars Dataframe library.β27Updated last year
- Official software repository of L. Delfino, D. Erriquez, S. Martinico, F. M. Nardini, C. Rulli, and R. Venturini. "kANNolo: Sweet and Smoβ¦β44Updated last month
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024β66Updated last month
- Tree-based indexes for neural-searchβ31Updated last year
- Modular Rust transformer/LLM library using Candleβ37Updated last year
- open source tooling for AI search and understandingβ51Updated 2 years ago
- Comparing performance-oriented string-processing libraries for substring search, multi-pattern matching, hashing, edit-distances, sketchiβ¦β134Updated this week
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrievaβ¦β100Updated last month
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.β85Updated 10 months ago
- β89Updated 5 months ago
- build your own vector database -- the littlest hnswβ67Updated 11 months ago
- Contextualized per-token embeddingsβ34Updated 7 months ago
- High-Performance Engine for Multi-Vector Searchβ193Updated 2 weeks ago
- Locality Sensitive Hashingβ77Updated 2 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"β65Updated 2 years ago
- β13Updated last week
- A collection of optimisers for use with candleβ44Updated last week
- Real-time data processing/feature engineering in Rust and Python. Tailored for modern AI/ML systems.β73Updated this week
- Rust crate for some audio utilitiesβ25Updated 9 months ago
- Embeddable library or single binary for indexing and searching 1B vectorsβ340Updated last week
- β135Updated last year
- Tantivy directory implementation backed by object_storeβ37Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β174Updated 7 months ago