bennylp / saxpy-benchmark
SAXPY benchmark for CPU and GP-GPU
☆40Updated 7 years ago
Alternatives and similar repositories for saxpy-benchmark:
Users that are interested in saxpy-benchmark are comparing it to the libraries listed below
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 10 months ago
- ☆29Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated last month
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 2 months ago
- SkelCL is a library providing high-level abstractions for alleviated programming of modern parallel heterogeneous systems. SkelCL is a re…☆30Updated 8 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆84Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 6 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated last week
- Next generation LAPACK implementation for ROCm platform☆99Updated last week
- ROCm Parallel Primitives☆171Updated last week
- High-performance, GPU-aware communication library☆85Updated 2 months ago
- RAJA Performance Suite☆118Updated this week
- a software library containing Sparse functions written in OpenCL☆174Updated 5 years ago
- Kernel Tuning Toolkit☆59Updated 2 weeks ago
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆45Updated 5 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated last week
- Reusable software components for ROCm developers☆83Updated last week
- DLA-Future☆70Updated last week
- Autonomic Performance Environment for eXascale (APEX)☆45Updated this week
- C++ Library for Portable SIMD Vectorization☆83Updated 4 months ago
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆133Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Next generation SPARSE implementation for ROCm platform☆119Updated this week
- A streamlined CMake build system foundation for developing HPC software☆269Updated 3 weeks ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated 2 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- RAND library for HIP programming language☆117Updated last week