trxcllnt / rapids-composeLinks
☆26Updated last year
Alternatives and similar repositories for rapids-compose
Users that are interested in rapids-compose are comparing it to the libraries listed below
Sorting:
- a repo for how to set up xubuntu like me☆29Updated 2 years ago
- RAPIDS Memory Manager☆663Updated this week
- ☆597Updated this week
- ☆42Updated this week
- RAPIDS GPU-BDB☆108Updated last year
- ☆21Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆566Updated 2 months ago
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆366Updated last year
- Abstraction Library for Parallel Kernel Acceleration☆396Updated this week
- ☆19Updated 6 years ago
- KvikIO - High Performance File IO☆233Updated last week
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆52Updated 2 months ago
- Morpheus Runtime Core (MRC)☆50Updated last month
- STREAM, for lots of devices written in many programming models☆352Updated 3 months ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 5 years ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆863Updated 3 weeks ago
- The Foundation for All Legate Libraries☆232Updated this week
- Caliper is an instrumentation and performance profiling library☆393Updated 3 weeks ago
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆229Updated this week
- Full-speed Array of Structures access☆176Updated 2 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆260Updated 10 months ago
- CUDA Kernel Benchmarking Library☆772Updated 2 weeks ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- Analyze graph/hierarchical performance data using pandas dataframes☆116Updated last month
- ☆26Updated this week
- CUDA kernel author's tools☆114Updated 3 years ago
- RAJA Performance Portability Layer (C++)☆551Updated last week
- Python SYCL bindings and SYCL-based Python Array API library☆118Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 3 years ago
- A Library for fast Hash Tables on GPUs☆127Updated last month