rapidsai / rmm
RAPIDS Memory Manager
☆572Updated last week
Alternatives and similar repositories for rmm:
Users that are interested in rmm are comparing it to the libraries listed below
- ☆537Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆376Updated 2 weeks ago
- CUDA Kernel Benchmarking Library☆621Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆689Updated 2 months ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆533Updated last month
- cuVS - a library for vector search and clustering on the GPU☆382Updated this week
- RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-a…☆870Updated this week
- CUDA Core Compute Libraries☆1,610Updated this week
- The Foundation for All Legate Libraries☆213Updated this week
- oneAPI Collective Communications Library (oneCCL)☆232Updated 3 weeks ago
- STREAM, for lots of devices written in many programming models☆333Updated 7 months ago
- Unified Collective Communication Library☆248Updated this week
- Kernel Tuner☆328Updated last week
- common in-memory tensor structure☆982Updated 2 weeks ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,745Updated last year
- Patterns and behaviors for GPU computing☆1,712Updated 2 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆318Updated this week
- KvikIO - High Performance File IO☆206Updated this week
- ROCm Communication Collectives Library (RCCL)☆326Updated this week
- ☆251Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆868Updated last week
- Training material for Nsight developer tools☆156Updated 8 months ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆837Updated this week
- AMD's graph optimization engine.☆215Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,313Updated this week
- RAJA Performance Portability Layer (C++)☆513Updated this week
- oneAPI Math Library (oneMath)☆667Updated last week
- ☆36Updated this week
- A Library for fast Hash Tables on GPUs☆115Updated 2 years ago
- A Toolkit for Programming Parallel Algorithms on Shared-Memory Multicore Machines☆357Updated 4 months ago