UoB-HPC / BabelStream
STREAM, for lots of devices written in many programming models
☆330Updated 6 months ago
Alternatives and similar repositories for BabelStream:
Users that are interested in BabelStream are comparing it to the libraries listed below
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 2 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆223Updated 3 weeks ago
- ☆232Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- Examples for HIP☆203Updated 3 months ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆389Updated 2 months ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆423Updated last week
- ☆250Updated this week
- SYCL Open Source Specification☆130Updated this week
- ROCm Parallel Primitives☆170Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆142Updated this week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆326Updated 2 weeks ago
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 10 months ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆137Updated last week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆211Updated this week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- oneAPI Collective Communications Library (oneCCL)☆225Updated 2 weeks ago
- Next generation BLAS implementation for ROCm platform☆362Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- SYCL Academy, a set of learning materials for SYCL heterogeneous programming☆479Updated last month
- Next generation FFT implementation for ROCm☆188Updated this week
- RAJA Performance Suite☆118Updated this week
- Next generation SPARSE implementation for ROCm platform☆119Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆115Updated last year
- ROCm BLAS marshalling library☆133Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- SYCL Benchmark Suite☆64Updated last month
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- RAJA Performance Portability Layer (C++)☆507Updated this week
- Unified Collective Communication Library☆237Updated this week