beowolx / rensaLinks

High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets

☆192

Alternatives and similar repositories for rensa

Users that are interested in rensa are comparing it to the libraries listed below

Sorting:

fbilhaut / gline-rs
Inference engine for GLiNER models, in Rust
☆64Updated 3 weeks ago
Oxen-AI / GRPO-With-Cargo-Feedback
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆100Updated 4 months ago
guidance-ai / llgtrt
TensorRT-LLM server with Structured Outputs (JSON) built with Rust
☆56Updated 3 months ago
mixedbread-ai / batched
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆142Updated 2 weeks ago
dottxt-ai / outlines-core
Faster structured generation
☆237Updated 2 months ago
mixedbread-ai / baguetter
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…
☆186Updated 11 months ago
serega / gaoya
Locality Sensitive Hashing
☆72Updated 2 years ago
lightonai / fast-plaid
High-Performance Engine for Multi-Vector Search
☆130Updated last month
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆64Updated 2 months ago
Dan-wanna-M / kbnf
A high-performance constrained decoding engine based on context free grammar in Rust
☆54Updated 2 months ago
LaurentMazare / mamba.rs
☆130Updated last year
jlscheerer / xtr-warp
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆150Updated 2 months ago
guidance-ai / llguidance
Super-fast Structured Outputs
☆342Updated last week
MinishLab / model2vec-rs
Official Rust Implementation of Model2Vec
☆122Updated 3 weeks ago
cohere-ai / BinaryVectorDB
Efficient vector database for hundred millions of embeddings.
☆207Updated last year
Dan-wanna-M / formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
☆220Updated last month
yaman / fashion-clip-rs
A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…
☆39Updated 11 months ago
cohere-ai / DiskVectorIndex
☆210Updated last month
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆84Updated 2 months ago
tomsanbear / candle-einops
☆31Updated 8 months ago
DeployQL / LintDB
Vector Database with support for late interaction and token level embeddings.
☆55Updated last month
Vaibhavs10 / fast-llm.rs
☆138Updated last year
AnswerDotAI / fastkmeans
☆63Updated 3 weeks ago
oxidized-transformers / oxidized-transformers
Modular Rust transformer/LLM library using Candle
☆36Updated last year
chenwanqq / candle-llava
implement llava using candle
☆15Updated last year
google / unisim
UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.
☆139Updated 3 months ago
jina-ai / correlations
Simple UI for debugging correlations of text embeddings
☆288Updated 2 months ago
QuixiAI / spectrum
☆128Updated 3 months ago
lightonai / pylate
Late Interaction Models Training & Retrieval
☆511Updated 2 weeks ago