harrism / ranger
Generate simple index ranges in C++ and CUDA C++
☆39Updated last year
Alternatives and similar repositories for ranger:
Users that are interested in ranger are comparing it to the libraries listed below
- Reusable software components for ROCm developers☆81Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆104Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated this week
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- SYCL Conformance Tests☆65Updated last week
- ☆23Updated 2 years ago
- CUDA kernel author's tools☆110Updated 2 years ago
- Distributed View Extension for Kokkos☆43Updated last month
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- AMD’s C++ library for accelerating tensor primitives☆38Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆49Updated last year
- DLA-Future☆69Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆139Updated this week
- BGHT: High-performance static GPU hash tables.☆57Updated 4 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆46Updated 3 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- SYCL Benchmark Suite☆60Updated 4 months ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆59Updated 7 months ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆67Updated last year
- RAJA Performance Suite☆118Updated this week
- Header-only C++20 wrapper for MPI 4.0.☆44Updated last year
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆48Updated this week
- High-performance, GPU-aware communication library☆84Updated 3 weeks ago
- SYCL Reference Manual☆27Updated 9 months ago
- Autonomic Performance Environment for eXascale (APEX)☆42Updated 2 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆116Updated last week
- hipFFT is a FFT marshalling library.☆57Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆73Updated last year
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Kernel Tuning Toolkit☆56Updated 2 months ago