beowolx / rensaLinks
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
☆230Updated last month
Alternatives and similar repositories for rensa
Users that are interested in rensa are comparing it to the libraries listed below
Sorting:
- Inference engine for GLiNER models, in Rust☆90Updated last month
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆114Updated 11 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆155Updated 6 months ago
- Official Rust Implementation of Model2Vec☆152Updated this week
- Faster structured generation☆275Updated 2 weeks ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆203Updated last year
- High-Performance Engine for Multi-Vector Search☆207Updated 3 weeks ago
- Locality Sensitive Hashing☆78Updated 2 years ago
- ☆210Updated 7 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆181Updated 9 months ago
- ☆135Updated last year
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆66Updated 9 months ago
- Embeddable library or single binary for indexing and searching 1B vectors☆366Updated last month
- Efficient vector database for hundred millions of embeddings.☆211Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆42Updated 4 months ago
- NLP with Rust for Python 🦀🐍☆71Updated 8 months ago
- ☆73Updated last month
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆146Updated 10 months ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆234Updated 8 months ago
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Updated last year
- PyLate efficient inference engine☆71Updated last month
- ☆140Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆58Updated 8 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated 3 months ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- Contextualized per-token embeddings☆34Updated 8 months ago
- Lightweight Nearest Neighbors with Flexible Backends☆333Updated last month
- ☆91Updated 7 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago