facebookresearch / vector_db_id_compressionLinks
Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.
☆89Updated 3 weeks ago
Alternatives and similar repositories for vector_db_id_compression
Users that are interested in vector_db_id_compression are comparing it to the libraries listed below
Sorting:
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆68Updated 3 months ago
- Graph Library for Approximate Similarity Search☆140Updated 5 months ago
- ☆207Updated this week
- Memory-Bounded GPU Acceleration for Vector Search☆33Updated last month
- Simple high-throughput inference library☆155Updated 8 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Updated 2 years ago
- ⚡ Faster similarity search with PDX: A vertical data layout for vectors☆71Updated 3 weeks ago
- DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval☆45Updated last week
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers☆153Updated last year
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆181Updated last month
- [SIGMOD 2024] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search☆176Updated 8 months ago
- A lightweight, user-friendly data-plane for LLM training.☆38Updated 4 months ago
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieva…☆104Updated last week
- Official code for "Binary embedding based retrieval at Tencent"☆44Updated last year
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆35Updated 3 weeks ago
- Bamboo-7B Large Language Model☆93Updated last year
- Collection of datasets for benchmarking filtered vector similarity retrieval☆59Updated 8 months ago
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆141Updated last year
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆175Updated 4 years ago
- High-performance safetensors model loader☆99Updated 3 weeks ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆181Updated 9 months ago
- Large Scale Search Index☆32Updated 2 years ago
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆43Updated this week
- Make triton easier☆50Updated last year
- Compression for Foundation Models☆35Updated 6 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆102Updated 11 months ago
- Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.☆122Updated last month
- LLM Serving Performance Evaluation Harness☆83Updated 11 months ago
- ☆219Updated last year
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆157Updated 10 months ago