rapidsai / cuvsLinks
cuVS - a library for vector search and clustering on the GPU
☆562Updated this week
Alternatives and similar repositories for cuvs
Users that are interested in cuvs are comparing it to the libraries listed below
Sorting:
- RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-a…☆951Updated this week
- RAPIDS Memory Manager☆655Updated this week
- ☆194Updated this week
- Framework for evaluating ANNS algorithms on billion scale datasets.☆409Updated 2 weeks ago
- Graph Library for Approximate Similarity Search☆132Updated 2 months ago
- KvikIO - High Performance File IO☆231Updated this week
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆170Updated 4 years ago
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search☆1,543Updated this week
- Perplexity GPU Kernels☆519Updated 2 weeks ago
- NVIDIA Inference Xfer Library (NIXL)☆712Updated this week
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆171Updated last month
- ☆77Updated 10 months ago
- ☆596Updated 2 weeks ago
- WholeGraph - large scale Graph Neural Networks☆105Updated 11 months ago
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆433Updated this week
- Knowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.☆218Updated 2 years ago
- Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.☆293Updated last week
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆175Updated last week
- KV cache store for distributed LLM inference☆358Updated 2 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆585Updated this week
- PyTorch Single Controller☆869Updated this week
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆446Updated this week
- torchcomms: a modern PyTorch communications API☆245Updated this week
- CUDA Kernel Benchmarking Library☆762Updated 3 weeks ago
- A throughput-oriented high-performance serving framework for LLMs☆912Updated last week
- kernels, of the mega variety☆597Updated last month
- Efficient and easy multi-instance LLM serving☆506Updated 2 months ago
- Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Lar…☆72Updated last month
- A library to analyze PyTorch traces.☆426Updated this week
- ArcticInference: vLLM plugin for high-throughput, low-latency inference☆292Updated last week