spotify / voyager
š°ļø An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
ā1,388Updated 2 weeks ago
Alternatives and similar repositories for voyager:
Users that are interested in voyager are comparing it to the libraries listed below
- Fast Open-Source Search & Clustering engine Ć for Vectors & š Strings Ć in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, Cā¦ā2,477Updated 2 weeks ago
- Collections of vector search related libraries, service and research papersā1,457Updated 6 months ago
- A Python vector database you just need - no more, no less.ā592Updated 11 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingā1,787Updated this week
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)ā770Updated last year
- fast vector database made in numpyā751Updated 9 months ago
- A SQLite extension for efficient vector search, based on Faiss!ā1,803Updated 9 months ago
- Blazing fast framework for fine-tuning similarity learning modelsā649Updated last month
- Things you can do with the token embeddings of an LLMā1,424Updated 2 weeks ago
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vā¦ā4,191Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsā4,332Updated this week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipyā1,019Updated last month
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.ā822Updated this week
- A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.ā680Updated 4 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.ā1,295Updated last week
- A deep dive into embeddings starting from fundamentalsā995Updated 3 months ago
- A Simple Bulk Labelling Toolā566Updated last month
- just a bunch of useful embeddings for scikit-learn pipelinesā480Updated last month
- The simplest way to serve AI/ML models in productionā951Updated this week
- A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.ā1,386Updated 2 weeks ago
- A Bulletproof Way to Generate Structured JSON from Language Modelsā4,582Updated 11 months ago
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.ā1,846Updated this week
- Creative interactive views of any dataset.ā835Updated last month
- Fast State-of-the-Art Static Embeddingsā1,070Updated this week
- ā”ļøA Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion šā519Updated 7 months ago
- PostgreSQL vector database extension for building AI applicationsā827Updated 2 months ago
- A system for agentic LLM-powered data processing and ETLā1,677Updated this week
- skops is a Python library helping you share your scikit-learn based models and put them in productionā466Updated this week
- Python client for Qdrant vector search engineā870Updated this week
- Fast Semantic Text Deduplicationā532Updated this week