🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
☆1,571Mar 1, 2026Updated 2 months ago
Alternatives and similar repositories for voyager
Users that are interested in voyager are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,239Oct 29, 2025Updated 6 months ago
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆4,110May 2, 2026Updated 3 weeks ago
- Header-only C++/python library for fast approximate nearest neighbors☆5,236Mar 28, 2026Updated last month
- just a bunch of useful embeddings for scikit-learn pipelines☆526Feb 12, 2026Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,924May 17, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Efficient few-shot learning with Sentence Transformers☆2,741Apr 17, 2026Updated last month
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,866Oct 14, 2025Updated 7 months ago
- Structured Outputs☆13,891May 18, 2026Updated last week
- Late Interaction Models Training & Retrieval☆821Updated this week
- DSPy: The framework for programming—not prompting—language models☆34,631Updated this week
- Fast BM25 search in Python, powered by Numpy and Numba☆1,674May 18, 2026Updated last week
- A library for efficient similarity search and clustering of dense vectors.☆40,061May 15, 2026Updated last week
- Fast State-of-the-Art Static Embeddings☆2,071May 6, 2026Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,615Dec 20, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆10,349Updated this week
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆341Apr 25, 2025Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆894Apr 17, 2024Updated 2 years ago
- A vector indexing library to bring fast, fresh and filtered search to your database☆1,804Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,973Updated this week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆16,220Updated this week
- Benchmarks of approximate nearest neighbor libraries in Python☆5,673Jun 10, 2025Updated 11 months ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,513Updated this week
- Open-source vector similarity search for Postgres☆21,378Apr 27, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Neural Search☆371Mar 11, 2025Updated last year
- Full text search that feels like a numpy array☆310May 4, 2026Updated 3 weeks ago
- A blazing fast inference solution for text embeddings models☆4,808Apr 30, 2026Updated 3 weeks ago
- structured outputs for llms☆12,974May 17, 2026Updated last week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆31,401May 19, 2026Updated last week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,975Apr 27, 2026Updated 3 weeks ago
- Extremely fast Query Engine for DataFrames, written in Rust☆38,571Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,607Updated this week
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆902Nov 4, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)☆3,210May 13, 2026Updated last week
- A guidance language for controlling large language models.☆21,473Updated this week
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆994May 3, 2024Updated 2 years ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,804Mar 24, 2026Updated 2 months ago
- an ambient intelligence library☆6,156May 12, 2026Updated last week
- Fast Multimodal Semantic Deduplication & Filtering☆926May 4, 2026Updated 3 weeks ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,926Feb 24, 2024Updated 2 years ago