rapidsai / rapids-cmake
☆28Updated last week
Related projects: ⓘ
- Generate simple index ranges in C++ and CUDA C++☆38Updated last year
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆46Updated this week
- ☆18Updated this week
- pika builds on C++ std::execution with fiber, CUDA, HIP, and MPI support.☆62Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆90Updated 2 years ago
- A Low-Level Abstraction of Memory Access☆79Updated 6 months ago
- DLA-Future☆63Updated this week
- Unit benchmarks of CUDA event APIs.☆17Updated 4 months ago
- High-performance, GPU-aware communication library☆85Updated last month
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆350Updated last month
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆100Updated this week
- Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!☆73Updated 4 months ago
- Parallel selection on GPUs☆14Updated 3 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆43Updated 2 years ago
- A task benchmark☆39Updated last month
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆104Updated this week
- ☆18Updated this week
- SYCL Benchmark Suite☆57Updated last week
- ☆68Updated 4 years ago
- ☆27Updated 9 months ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆78Updated last month
- A library to benchmark CUDA code, similar to google benchmark.☆27Updated 3 years ago
- ☆31Updated this week
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 3 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆40Updated last month
- Reusable software components for ROCm developers☆81Updated last week
- CUDA kernel author's tools☆105Updated 2 years ago
- SYCL Open Source Specification☆109Updated this week
- SYCL Reference Manual☆25Updated 4 months ago
- Header-only C++20 wrapper for MPI 4.0.☆43Updated 10 months ago