rapidsai / raftLinks
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
☆972Updated this week
Alternatives and similar repositories for raft
Users that are interested in raft are comparing it to the libraries listed below
Sorting:
- cuVS - a library for vector search and clustering on the GPU☆611Updated this week
- RAPIDS Memory Manager☆677Updated this week
- Framework for evaluating ANNS algorithms on billion scale datasets.☆419Updated last month
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆172Updated 4 years ago
- ☆612Updated 2 weeks ago
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search☆1,644Updated this week
- Knowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.☆219Updated 2 years ago
- common in-memory tensor structure☆1,139Updated last month
- CUDA Kernel Benchmarking Library☆798Updated 2 weeks ago
- ☆204Updated this week
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆502Updated this week
- Graph Library for Approximate Similarity Search☆138Updated 4 months ago
- Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.☆314Updated this week
- CUDA Core Compute Libraries☆2,125Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,516Updated this week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,071Updated last year
- An open-source efficient deep learning framework/compiler, written in python.☆739Updated 4 months ago
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆176Updated 2 weeks ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆502Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆371Updated this week
- NVIDIA Inference Xfer Library (NIXL)☆820Updated this week
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆833Updated 5 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆411Updated last week
- The Triton backend for the ONNX Runtime.☆171Updated this week
- A throughput-oriented high-performance serving framework for LLMs☆937Updated 2 months ago
- The core library and APIs implementing the Triton Inference Server.☆162Updated last week
- ☆79Updated last year
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆189Updated 2 months ago
- GGNN: State of the Art Graph-based GPU Nearest Neighbor Search☆169Updated 11 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆859Updated 3 months ago