rapidsai / rmmLinks

RAPIDS Memory Manager

☆589

Alternatives and similar repositories for rmm

Users that are interested in rmm are comparing it to the libraries listed below

Sorting:

NVIDIA / cuCollections
☆543Updated last week
NVIDIA / nvbench
CUDA Kernel Benchmarking Library
☆670Updated this week
NVIDIA / NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…
☆402Updated last month
NVIDIA / jitify
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
☆545Updated last week
NVIDIA / multi-gpu-programming-models
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆743Updated 4 months ago
rapidsai / raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-a…
☆903Updated this week
nv-legate / legate
The Foundation for All Legate Libraries
☆218Updated last week
NVIDIA / cccl
CUDA Core Compute Libraries
☆1,711Updated this week
NVIDIA / cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
☆1,755Updated last year
NVIDIA / gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
☆1,138Updated 3 weeks ago
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆339Updated this week
dmlc / dlpack
common in-memory tensor structure
☆1,019Updated 2 weeks ago
rapidsai / cuvs
cuVS - a library for vector search and clustering on the GPU
☆449Updated this week
NVIDIA / MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
☆1,332Updated this week
sleeepyjack / warpcore
A Library for fast Hash Tables on GPUs
☆124Updated 2 years ago
eyalroz / cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs
☆844Updated last week
rapidsai / kvikio
KvikIO - High Performance File IO
☆213Updated this week
NVIDIA / nvcomp
Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…
☆591Updated 9 months ago
ROCm / rccl
ROCm Communication Collectives Library (RCCL)
☆343Updated this week
uxlfoundation / oneCCL
oneAPI Collective Communications Library (oneCCL)
☆237Updated 2 weeks ago
NVIDIA / nsight-training
Training material for Nsight developer tools
☆159Updated 10 months ago
microsoft / mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
☆379Updated this week
moderngpu / moderngpu
Patterns and behaviors for GPU computing
☆1,725Updated 3 years ago
openucx / ucc
Unified Collective Communication Library
☆256Updated last week
pytorch / tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
☆258Updated 2 years ago
ROCm / composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
☆427Updated this week
ekondis / mixbench
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
☆404Updated 5 months ago
rapidsai / ucx-py
Python bindings for UCX
☆137Updated this week
NVIDIA-Merlin / HierarchicalKV
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…
☆152Updated last week
KernelTuner / kernel_tuner
Kernel Tuner
☆345Updated last week