rapidsai / cuvsLinks
cuVS - a library for vector search and clustering on the GPU
☆596Updated this week
Alternatives and similar repositories for cuvs
Users that are interested in cuvs are comparing it to the libraries listed below
Sorting:
- RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-a…☆963Updated this week
- RAPIDS Memory Manager☆666Updated this week
- ☆198Updated last week
- Framework for evaluating ANNS algorithms on billion scale datasets.☆413Updated 3 weeks ago
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆171Updated 4 years ago
- Graph Library for Approximate Similarity Search☆136Updated 3 months ago
- Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.☆303Updated this week
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆174Updated 2 months ago
- Knowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.☆219Updated 2 years ago
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search☆1,587Updated last week
- NVIDIA Inference Xfer Library (NIXL)☆770Updated last week
- ☆78Updated 11 months ago
- KvikIO - High Performance File IO☆233Updated this week
- PyTorch Single Controller☆928Updated this week
- ☆601Updated last week
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆183Updated last month
- The core library and APIs implementing the Triton Inference Server.☆157Updated last week
- MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrat…☆101Updated last year
- A library to analyze PyTorch traces.☆449Updated last week
- torchcomms: a modern PyTorch communications API☆309Updated this week
- Perplexity GPU Kernels☆539Updated last month
- CUDA Kernel Benchmarking Library☆773Updated last week
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆444Updated this week
- KV cache store for distributed LLM inference☆376Updated last month
- ☆127Updated last week
- WholeGraph - large scale Graph Neural Networks☆106Updated last year
- A throughput-oriented high-performance serving framework for LLMs☆924Updated last month
- ☆72Updated 10 months ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆488Updated this week
- ArcticInference: vLLM plugin for high-throughput, low-latency inference☆349Updated last week