🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
☆1,578Mar 1, 2026Updated 4 months ago
Alternatives and similar repositories for voyager
Users that are interested in voyager are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,256Oct 29, 2025Updated 8 months ago
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆4,196May 28, 2026Updated last month
- Header-only C++/python library for fast approximate nearest neighbors☆5,261Mar 28, 2026Updated 3 months ago
- just a bunch of useful embeddings for scikit-learn pipelines☆527Feb 12, 2026Updated 4 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,938May 17, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Efficient few-shot learning with Sentence Transformers☆2,761May 26, 2026Updated last month
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,891Oct 14, 2025Updated 8 months ago
- Structured Outputs☆14,273Updated this week
- Late Interaction Models Training & Retrieval☆859Jun 25, 2026Updated last week
- DSPy: The framework for programming—not prompting—language models☆35,605Jun 25, 2026Updated last week
- Fast BM25 search in Python, powered by Numpy and Numba☆1,724Jun 11, 2026Updated 3 weeks ago
- A library for efficient similarity search and clustering of dense vectors.☆40,426Updated this week
- Fast State-of-the-Art Static Embeddings☆2,138Jun 6, 2026Updated 3 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,622Dec 20, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆10,768Updated this week
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆342Apr 25, 2025Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆895Apr 17, 2024Updated 2 years ago
- A vector indexing library to bring fast, fresh and filtered search to your database☆1,863Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆3,058Jun 23, 2026Updated last week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆16,487Updated this week
- Benchmarks of approximate nearest neighbor libraries in Python☆5,694Jun 10, 2025Updated last year
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,727Jun 27, 2026Updated last week
- Open-source vector similarity search for Postgres☆22,007Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Neural Search☆371Mar 11, 2025Updated last year
- A blazing fast inference solution for text embeddings models☆4,908Jun 22, 2026Updated last week
- Full text search that feels like a numpy array☆311May 4, 2026Updated 2 months ago
- structured outputs for llms☆13,328Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆32,812Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆5,022Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆38,879Jun 26, 2026Updated last week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,683Jun 22, 2026Updated last week
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆904Nov 4, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A guidance language for controlling large language models.☆21,519May 21, 2026Updated last month
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)☆3,341Jun 16, 2026Updated 2 weeks ago
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆996May 3, 2024Updated 2 years ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,857Mar 24, 2026Updated 3 months ago
- an ambient intelligence library☆6,174May 12, 2026Updated last month
- Fast Multimodal Semantic Deduplication & Filtering☆940May 24, 2026Updated last month
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,930Feb 24, 2024Updated 2 years ago