ytgui / PilotANNLinks
Memory-Bounded GPU Acceleration for Vector Search
☆25Updated 2 months ago
Alternatives and similar repositories for PilotANN
Users that are interested in PilotANN are comparing it to the libraries listed below
Sorting:
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆64Updated last year
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆27Updated last month
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆62Updated 8 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆80Updated 5 months ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆21Updated last year
- Graph Library for Approximate Similarity Search☆121Updated 2 weeks ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 3 months ago
- ☆17Updated 3 weeks ago
- Compression for Foundation Models☆32Updated 3 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆79Updated 3 months ago
- ☆29Updated 5 months ago
- Official software repository of L. Delfino, D. Erriquez, S. Martinico, F. M. Nardini, C. Rulli, and R. Venturini. "kANNolo: Sweet and Smo…☆33Updated last week
- A lightweight, user-friendly data-plane for LLM training.☆19Updated 2 months ago
- ☆10Updated last year
- Latent Large Language Models☆18Updated 10 months ago
- ☆18Updated 3 weeks ago
- Repository related to the Dynamic Exploration Graph and its previous iterations.☆26Updated 2 weeks ago
- Port of Facebook's LLaMA model in C/C++☆22Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 8 months ago
- Collection of datasets for benchmarking filtered vector similarity retrieval☆43Updated 3 weeks ago
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆29Updated last month
- ⚡ Faster vector search with PDX: A vertical data layout for vectors☆39Updated 2 weeks ago
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆148Updated this week
- ☆73Updated 5 months ago
- [VLDB 25] Maximum Inner Product is Query-Scaled Nearest Neighbor☆19Updated last month
- MPI Code Generation through Domain-Specific Language Models☆14Updated 7 months ago
- ☆34Updated last year
- ☆167Updated this week
- [MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models☆16Updated 2 years ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆131Updated last month