UoB-HPC / BabelStream
STREAM, for lots of devices written in many programming models
☆332Updated 7 months ago
Alternatives and similar repositories for BabelStream:
Users that are interested in BabelStream are comparing it to the libraries listed below
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- Advanced Profiling and Analytics for AMD Hardware☆145Updated this week
- SYCL Open Source Specification☆134Updated this week
- SYCL Benchmark Suite☆64Updated last month
- ☆236Updated last week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- Examples for HIP☆204Updated 4 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆426Updated 2 weeks ago
- SYCL Academy, a set of learning materials for SYCL heterogeneous programming☆481Updated 2 months ago
- ROCm Parallel Primitives☆171Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆235Updated this week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆217Updated this week
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆269Updated 2 weeks ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 10 months ago
- Next generation SPARSE implementation for ROCm platform☆119Updated this week
- RAND library for HIP programming language☆117Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆71Updated this week
- Reusable software components for ROCm developers☆83Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆141Updated this week
- oneAPI Collective Communications Library (oneCCL)☆232Updated last week
- SYCL Conformance Tests☆69Updated this week
- ☆250Updated last week
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆51Updated this week
- ROCm Device Libraries☆97Updated 11 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆327Updated this week