UoB-HPC / BabelStream
STREAM, for lots of devices written in many programming models
☆334Updated 8 months ago
Alternatives and similar repositories for BabelStream:
Users that are interested in BabelStream are comparing it to the libraries listed below
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- ☆239Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆152Updated this week
- SYCL Open Source Specification☆134Updated last week
- Stretching GPU performance for GEMMs and tensor contractions.☆237Updated last week
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆431Updated 2 weeks ago
- Examples for HIP☆205Updated 5 months ago
- ROCm Parallel Primitives☆171Updated last week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆219Updated this week
- RAJA Performance Portability Layer (C++)☆516Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆144Updated last week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated last month
- ☆251Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- RAJA Performance Suite☆117Updated 3 weeks ago
- An implementation of HIP that works on CPUs, across OSes.☆116Updated last year
- oneAPI Collective Communications Library (oneCCL)☆232Updated last week
- HPCToolkit performance tools: measurement and analysis components☆341Updated 2 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆328Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆336Updated this week
- ROCm BLAS marshalling library☆140Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆397Updated 3 months ago
- Next generation LAPACK implementation for ROCm platform☆100Updated this week
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 11 months ago
- Next generation FFT implementation for ROCm☆191Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆108Updated this week
- oneAPI Level Zero Specification Headers and Loader☆259Updated last week
- The SHOC Benchmark Suite☆252Updated 3 years ago
- SYCL Benchmark Suite☆64Updated 2 months ago
- Next generation BLAS implementation for ROCm platform☆367Updated this week