Lightweight Nearest Neighbors with Flexible Backends
☆334Dec 30, 2025Updated 2 months ago
Alternatives and similar repositories for vicinity
Users that are interested in vicinity are comparing it to the libraries listed below
Sorting:
- Fast Multimodal Semantic Deduplication & Filtering☆890Jan 20, 2026Updated last month
- Fast State-of-the-Art Static Embeddings☆2,003Feb 13, 2026Updated 2 weeks ago
- Late Interaction Models Training & Retrieval☆721Feb 18, 2026Updated last week
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆20Jun 26, 2024Updated last year
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- Official Rust Implementation of Model2Vec☆160Feb 5, 2026Updated 3 weeks ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆254Jun 11, 2025Updated 8 months ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,837Updated this week
- ☆91Jul 4, 2025Updated 7 months ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Inference engine for GLiNER models, in Rust☆98Jan 10, 2026Updated last month
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,597Dec 20, 2025Updated 2 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,500Feb 17, 2026Updated last week
- Efficient few-shot learning with Sentence Transformers☆2,683Dec 11, 2025Updated 2 months ago
- ☆21Oct 14, 2024Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆522Feb 12, 2026Updated 2 weeks ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆78Feb 10, 2026Updated 2 weeks ago
- Embedding Vector Oriented Clustering☆173Feb 4, 2026Updated 3 weeks ago
- Gather module dependencies of source code☆13Jul 21, 2023Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,903Updated this week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆206Aug 31, 2024Updated last year
- Code for our paper accepted at EMNLP 2023 (Findings)☆14Jan 5, 2024Updated 2 years ago
- Multi-model transactional embedded database☆68Dec 10, 2024Updated last year
- Things you can do with the token embeddings of an LLM☆1,453Dec 1, 2025Updated 2 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,733Jan 9, 2026Updated last month
- Structured Outputs☆13,456Feb 13, 2026Updated 2 weeks ago
- A Python library for calculating a large variety of metrics from text☆360Jan 30, 2026Updated last month
- An advanced automation framework for audio mixer consoles, OBS, PTZ cameras and more based on the Open Sound Control protocol.☆113Nov 13, 2023Updated 2 years ago
- Generalist and Lightweight Model for Text Classification☆193Feb 17, 2026Updated last week
- Robust and fast topic models with sentence-transformers.☆94Feb 3, 2026Updated 3 weeks ago
- ai for jq☆249Sep 20, 2024Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,859May 17, 2025Updated 9 months ago
- Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs☆30Jan 20, 2025Updated last year
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- A fast multi-core implementation of HDBSCAN for low dimensional Euclidean spaces☆132Feb 4, 2026Updated 3 weeks ago
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Jun 9, 2025Updated 8 months ago
- PageRank for LLMs☆52Sep 10, 2025Updated 5 months ago
- 🔢 Work with static vector models☆37Apr 21, 2025Updated 10 months ago