beowolx / rensaLinks
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
☆217Updated 2 months ago
Alternatives and similar repositories for rensa
Users that are interested in rensa are comparing it to the libraries listed below
Sorting:
- Inference engine for GLiNER models, in Rust☆79Updated 2 weeks ago
- High-Performance Engine for Multi-Vector Search☆189Updated this week
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆112Updated 8 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆153Updated 4 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆195Updated last year
- Official Rust Implementation of Model2Vec☆142Updated 2 months ago
- Locality Sensitive Hashing☆76Updated 2 years ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆173Updated 7 months ago
- Faster structured generation☆262Updated last month
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆62Updated 7 months ago
- ☆135Updated last year
- NLP with Rust for Python 🦀🐍☆70Updated 6 months ago
- Embeddable library or single binary for indexing and searching 1B vectors☆337Updated this week
- A high-performance constrained decoding engine based on context free grammar in Rust☆56Updated 6 months ago
- ☆210Updated 5 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- Vector Database with support for late interaction and token level embeddings.☆54Updated 5 months ago
- Efficient vector database for hundred millions of embeddings.☆211Updated last year
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Updated last year
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆231Updated 5 months ago
- implement llava using candle☆15Updated last year
- ☆86Updated 5 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆39Updated last month
- ☆140Updated last year
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆144Updated 8 months ago
- Contextualized per-token embeddings☆33Updated 6 months ago
- Lightweight Nearest Neighbors with Flexible Backends☆319Updated 2 months ago
- Pre-train Static Word Embeddings☆92Updated 2 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- ☆136Updated last year