ytgui / PilotANNLinks
Memory-Bounded GPU Acceleration for Vector Search
☆32Updated 2 months ago
Alternatives and similar repositories for PilotANN
Users that are interested in PilotANN are comparing it to the libraries listed below
Sorting:
- DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval☆29Updated 2 weeks ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆29Updated last year
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆85Updated 11 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Updated 2 years ago
- [VLDB 25] Maximum Inner Product is Query-Scaled Nearest Neighbor☆34Updated last month
- Compression for Foundation Models☆34Updated 5 months ago
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆174Updated last week
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆66Updated 2 months ago
- Graph Library for Approximate Similarity Search☆136Updated 3 months ago
- Collection of datasets for benchmarking filtered vector similarity retrieval☆58Updated 6 months ago
- ☆26Updated 3 months ago
- ☆24Updated last year
- Repository related to the Dynamic Exploration Graph and its previous iterations.☆28Updated this week
- Large Scale Search Index☆31Updated 2 years ago
- ☆47Updated 8 months ago
- A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of yo…☆22Updated 10 months ago
- PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]☆48Updated 2 months ago
- ☆63Updated 7 months ago
- ☆12Updated last year
- ☆13Updated 11 months ago
- Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.☆108Updated 3 months ago
- ☆198Updated this week
- QJL: 1-Bit Quantized JL transform for KV Cache Quantization with Zero Overhead☆31Updated 11 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆95Updated 9 months ago
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆39Updated 5 months ago
- Algorithms for approximate nearest neighbor search with window filters☆45Updated last year
- ☆34Updated 10 months ago
- MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrat…☆101Updated last year
- Easy, Fast, and Scalable Multimodal AI☆81Updated last week
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆30Updated 8 months ago