spotify / voyager
π°οΈ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
β1,319Updated last week
Related projects β
Alternatives and complementary repositories for voyager
- Fast Open-Source Search & Clustering engine Γ for Vectors & π Strings Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, Cβ¦β2,264Updated this week
- Collections of vector search related libraries, service and research papersβ1,409Updated 3 months ago
- A SQLite extension for efficient vector search, based on Faiss!β1,736Updated 6 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ1,526Updated this week
- A Python vector database you just need - no more, no less.β560Updated 8 months ago
- Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster thanβ¦β1,050Updated last month
- Automatically create Faiss knn indices with the most optimal similarity search parameters.β817Updated 6 months ago
- Blazing fast framework for fine-tuning similarity learning modelsβ643Updated last month
- Things you can do with the token embeddings of an LLMβ1,376Updated last week
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)β3,072Updated this week
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Searchβ1,140Updated this week
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.β1,754Updated this week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipyβ899Updated last week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ3,986Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,095Updated last week
- Vald. A Highly Scalable Distributed Vector Search Engineβ1,537Updated this week
- Hierarchical Navigable Small World (HNSW) algorithm for vector similarity search in PostgreSQLβ566Updated 11 months ago
- A modern model graph visualizer and debuggerβ1,058Updated this week
- A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.β648Updated last month
- Class notes for the course "Long Term Memory in AI - Vector Search and Databases" COS 597A @ Princeton Fall 2023β310Updated last year
- Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Dataβ1,258Updated last week
- Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!β4,766Updated this week
- SPLADE: sparse neural search (SIGIR21, SIGIR22)β780Updated 6 months ago
- A complement to pgvector for high performance, cost efficient vector search on large workloads.β1,348Updated this week
- Examples of programs built using Modalβ730Updated this week
- A blazing fast inference solution for text embeddings modelsβ2,846Updated 2 weeks ago
- π¦ Integrating LLMs into structured NLP pipelinesβ1,136Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,057Updated 2 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpaliβ1,473Updated this week
- A deep dive into embeddings starting from fundamentalsβ965Updated this week