🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
☆1,554Mar 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for voyager
Users that are interested in voyager are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,185Oct 29, 2025Updated 4 months ago
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆3,974Mar 2, 2026Updated 3 weeks ago
- Header-only C++/python library for fast approximate nearest neighbors☆5,126Updated this week
- just a bunch of useful embeddings for scikit-learn pipelines☆523Feb 12, 2026Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,882May 17, 2025Updated 10 months ago
- Efficient few-shot learning with Sentence Transformers☆2,699Dec 11, 2025Updated 3 months ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,799Oct 14, 2025Updated 5 months ago
- Late Interaction Models Training & Retrieval☆754Mar 6, 2026Updated 2 weeks ago
- Structured Outputs☆13,588Updated this week
- DSPy: The framework for programming—not prompting—language models☆33,038Updated this week
- Fast lexical search implementing BM25 in Python☆1,589Mar 17, 2026Updated last week
- A library for efficient similarity search and clustering of dense vectors.☆39,403Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,605Dec 20, 2025Updated 3 months ago
- Fast State-of-the-Art Static Embeddings☆2,011Mar 12, 2026Updated last week
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆9,536Updated this week
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆336Apr 25, 2025Updated 10 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,791Mar 12, 2026Updated last week
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search☆1,725Updated this week
- 🤖 A PyTorch library of curated Transformer models and their composable components☆894Apr 17, 2024Updated last year
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆15,834Updated this week
- Benchmarks of approximate nearest neighbor libraries in Python☆5,619Jun 10, 2025Updated 9 months ago
- Open-source vector similarity search for Postgres☆20,337Updated this week
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,187Updated this week
- Neural Search☆366Mar 11, 2025Updated last year
- Full text search that feels like a numpy array☆304Feb 1, 2026Updated last month
- A blazing fast inference solution for text embeddings models☆4,600Mar 13, 2026Updated last week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆29,611Updated this week
- structured outputs for llms☆12,551Mar 17, 2026Updated last week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,896Mar 16, 2026Updated last week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,810Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,322Updated this week
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆900Nov 4, 2025Updated 4 months ago
- A guidance language for controlling large language models.☆21,356Updated this week
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆984May 3, 2024Updated last year
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,724Feb 5, 2026Updated last month
- an ambient intelligence library☆6,100Updated this week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,961Updated this week
- Fast Multimodal Semantic Deduplication & Filtering☆897Jan 20, 2026Updated 2 months ago
- Collections of vector search related libraries, service and research papers☆1,552Aug 6, 2024Updated last year