ytgui / PilotANNLinks
Memory-Bounded GPU Acceleration for Vector Search
☆32Updated 2 weeks ago
Alternatives and similar repositories for PilotANN
Users that are interested in PilotANN are comparing it to the libraries listed below
Sorting:
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆87Updated 11 months ago
- DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval☆37Updated last month
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆30Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Updated 2 years ago
- Graph Library for Approximate Similarity Search☆138Updated 4 months ago
- [VLDB 25] Maximum Inner Product is Query-Scaled Nearest Neighbor☆35Updated 2 months ago
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆39Updated last week
- Compression for Foundation Models☆35Updated 5 months ago
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆67Updated 2 months ago
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆176Updated last week
- ☆26Updated 4 months ago
- ☆14Updated 11 months ago
- Official code for "Binary embedding based retrieval at Tencent"☆44Updated last year
- Large Scale Search Index☆31Updated 2 years ago
- ☆203Updated last week
- Modular and structured prompt caching for low-latency LLM inference☆110Updated last year
- ☆27Updated 9 months ago
- MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrat…☆102Updated last year
- Bamboo-7B Large Language Model☆93Updated last year
- ☆24Updated last year
- Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.☆113Updated 2 weeks ago
- Repository related to the Dynamic Exploration Graph and its previous iterations.☆28Updated last week
- ☆63Updated 8 months ago
- A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of yo…☆22Updated 11 months ago
- PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]☆48Updated 2 months ago
- Collection of datasets for benchmarking filtered vector similarity retrieval☆58Updated 7 months ago
- ⚡ Faster similarity search with PDX: A vertical data layout for vectors☆64Updated 4 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆99Updated 10 months ago
- Block-based Approximate Nearest Neighbor☆35Updated 4 years ago
- Easy, Fast, and Scalable Multimodal AI☆92Updated this week