spotify / voyager
π°οΈ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
β1,311Updated this week
Related projects β
Alternatives and complementary repositories for voyager
- Fast Open-Source Search & Clustering engine Γ for Vectors & π Strings Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, Cβ¦β2,241Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ1,497Updated this week
- A Python vector database you just need - no more, no less.β556Updated 8 months ago
- Collections of vector search related libraries, service and research papersβ1,407Updated 3 months ago
- AICI: Prompts as (Wasm) Programsβ1,931Updated last month
- Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models, clip, clap and colpaliβ1,434Updated this week
- Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!β4,685Updated this week
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)β3,053Updated 2 months ago
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vβ¦β3,932Updated this week
- Blazing fast framework for fine-tuning similarity learning modelsβ642Updated last month
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipyβ874Updated last week
- fast vector database made in numpyβ743Updated 6 months ago
- Llama 2 Everywhere (L2E)β1,511Updated 2 weeks ago
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.β1,750Updated this week
- A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.β646Updated last month
- Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster thanβ¦β1,048Updated last month
- π€ A PyTorch library of curated Transformer models and their composable componentsβ865Updated 6 months ago
- PostgreSQL vector database extension for building AI applicationsβ779Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,042Updated this week
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)β772Updated last year
- π¦ Integrating LLMs into structured NLP pipelinesβ1,121Updated 3 months ago
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Searchβ1,128Updated this week
- Distributed data engine for Python/SQL designed for the cloud, powered by Rustβ2,312Updated this week
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.β1,817Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ2,183Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,036Updated 2 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β834Updated 9 months ago
- Neural Searchβ344Updated 5 months ago
- A SQLite extension for efficient vector search, based on Faiss!β1,731Updated 6 months ago