UoB-HPC / BabelStream
STREAM, for lots of devices written in many programming models
☆326Updated 5 months ago
Alternatives and similar repositories for BabelStream:
Users that are interested in BabelStream are comparing it to the libraries listed below
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆217Updated this week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆263Updated last month
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆211Updated this week
- SYCL Open Source Specification☆127Updated this week
- oneAPI Collective Communications Library (oneCCL)☆222Updated 3 weeks ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆420Updated 2 months ago
- ☆228Updated this week
- ROCm Parallel Primitives☆170Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆140Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆209Updated 2 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆322Updated 2 weeks ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Examples for HIP☆202Updated 2 months ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆382Updated last month
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆262Updated last month
- RAJA Performance Suite☆118Updated this week
- Next generation BLAS implementation for ROCm platform☆359Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆105Updated this week
- RAND library for HIP programming language☆115Updated this week
- CUDA Kernel Benchmarking Library☆561Updated 3 months ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- ROCm BLAS marshalling library☆132Updated this week
- SYCL Benchmark Suite☆61Updated last week
- oneAPI Level Zero Specification Headers and Loader☆237Updated this week
- ☆250Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆115Updated 11 months ago
- Unified Collective Communication Library☆227Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆521Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆610Updated 3 months ago