🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
☆1,554Mar 1, 2026Updated last month
Alternatives and similar repositories for voyager
Users that are interested in voyager are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,210Oct 29, 2025Updated 5 months ago
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆4,028Updated this week
- Header-only C++/python library for fast approximate nearest neighbors☆5,165Mar 28, 2026Updated 2 weeks ago
- just a bunch of useful embeddings for scikit-learn pipelines☆525Feb 12, 2026Updated 2 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,897May 17, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Efficient few-shot learning with Sentence Transformers☆2,710Apr 2, 2026Updated last week
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,822Oct 14, 2025Updated 6 months ago
- Structured Outputs☆13,657Mar 26, 2026Updated 2 weeks ago
- Late Interaction Models Training & Retrieval☆783Mar 6, 2026Updated last month
- DSPy: The framework for programming—not prompting—language models☆33,649Updated this week
- Fast BM25 search in Python, powered by Numpy and Numba☆1,615Apr 5, 2026Updated last week
- A library for efficient similarity search and clustering of dense vectors.☆39,720Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,605Dec 20, 2025Updated 3 months ago
- Fast State-of-the-Art Static Embeddings☆2,024Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆9,828Apr 8, 2026Updated last week
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆338Apr 25, 2025Updated 11 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,853Updated this week
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search☆1,744Updated this week
- 🤖 A PyTorch library of curated Transformer models and their composable components☆895Apr 17, 2024Updated last year
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆15,978Updated this week
- Benchmarks of approximate nearest neighbor libraries in Python☆5,641Jun 10, 2025Updated 10 months ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,310Updated this week
- Open-source vector similarity search for Postgres☆20,666Mar 17, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Neural Search☆369Mar 11, 2025Updated last year
- Full text search that feels like a numpy array☆308Feb 1, 2026Updated 2 months ago
- A blazing fast inference solution for text embeddings models☆4,663Apr 7, 2026Updated last week
- structured outputs for llms☆12,749Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆30,313Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,931Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆38,112Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,395Apr 8, 2026Updated last week
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆901Nov 4, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A guidance language for controlling large language models.☆21,381Updated this week
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆990May 3, 2024Updated last year
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,752Mar 24, 2026Updated 3 weeks ago
- an ambient intelligence library☆6,127Updated this week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)☆3,060Mar 31, 2026Updated 2 weeks ago
- Fast Multimodal Semantic Deduplication & Filtering☆910Jan 20, 2026Updated 2 months ago
- Collections of vector search related libraries, service and research papers☆1,555Aug 6, 2024Updated last year