harrism / ranger
Generate simple index ranges in C++ and CUDA C++
☆39Updated last year
Alternatives and similar repositories for ranger:
Users that are interested in ranger are comparing it to the libraries listed below
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆105Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated 2 weeks ago
- Reusable software components for ROCm developers☆81Updated this week
- CUDA kernel author's tools☆110Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last year
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆71Updated 9 years ago
- Distributed View Extension for Kokkos☆44Updated 2 months ago
- SYCL Conformance Tests☆67Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- DLA-Future☆69Updated this week
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- Unit benchmarks of CUDA event APIs.☆17Updated 9 months ago
- Reference implementation of the draft C++ GraphBLAS specification.☆30Updated last year
- AMD’s C++ library for accelerating tensor primitives☆37Updated this week
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆49Updated this week
- ☆23Updated 3 years ago
- Header-only C++20 wrapper for MPI 4.0.☆44Updated last year
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆35Updated this week
- Autonomic Performance Environment for eXascale (APEX)☆43Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆118Updated 3 weeks ago
- SYCL Benchmark Suite☆61Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆45Updated 3 years ago
- Department of Energy Standard Utility Library☆30Updated 5 months ago
- Full-speed Array of Structures access☆164Updated last year
- High-level C++ for Accelerator Clusters☆144Updated 2 weeks ago
- Kernel Tuning Toolkit☆56Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- SYCL Open Source Specification☆125Updated this week
- mallocMC: Memory Allocator for Many Core Architectures☆54Updated this week