🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
☆1,549Mar 1, 2026Updated this week
Alternatives and similar repositories for voyager
Users that are interested in voyager are comparing it to the libraries listed below
Sorting:
- Header-only C++/python library for fast approximate nearest neighbors☆5,106Sep 14, 2025Updated 5 months ago
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆3,910Feb 22, 2026Updated last week
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,169Oct 29, 2025Updated 4 months ago
- just a bunch of useful embeddings for scikit-learn pipelines☆522Feb 12, 2026Updated 2 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,859May 17, 2025Updated 9 months ago
- Efficient few-shot learning with Sentence Transformers☆2,688Dec 11, 2025Updated 2 months ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,782Oct 14, 2025Updated 4 months ago
- Late Interaction Models Training & Retrieval☆732Updated this week
- Structured Outputs☆13,488Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,381Feb 24, 2026Updated last week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,500Feb 17, 2026Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddings☆2,003Feb 13, 2026Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,599Dec 20, 2025Updated 2 months ago
- Neural Search☆367Mar 11, 2025Updated 11 months ago
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆9,141Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆39,195Feb 24, 2026Updated last week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,744Jan 9, 2026Updated last month
- 🤖 A PyTorch library of curated Transformer models and their composable components☆894Apr 17, 2024Updated last year
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆15,690Updated this week
- Full text search that feels like a numpy array☆303Feb 1, 2026Updated last month
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,123Updated this week
- A blazing fast inference solution for text embeddings models☆4,525Updated this week
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search☆1,712Updated this week
- Open-source vector similarity search for Postgres☆19,981Updated this week
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆335Apr 25, 2025Updated 10 months ago
- Benchmarks of approximate nearest neighbor libraries in Python☆5,605Jun 10, 2025Updated 8 months ago
- structured outputs for llms☆12,428Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,875Feb 23, 2026Updated last week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,688Feb 5, 2026Updated 3 weeks ago
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆29,102Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,582Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,247Updated this week
- Fast Multimodal Semantic Deduplication & Filtering☆890Jan 20, 2026Updated last month
- A guidance language for controlling large language models.☆21,327Feb 13, 2026Updated 2 weeks ago
- ⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍☆655Aug 7, 2025Updated 6 months ago
- Represent, send, store and search multimodal data☆3,115Jan 13, 2026Updated last month
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data …☆11,346Jan 13, 2026Updated last month
- an ambient intelligence library☆6,089Updated this week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,865Updated this week