zjin-lcf / HeCBench
☆233Updated this week
Alternatives and similar repositories for HeCBench:
Users that are interested in HeCBench are comparing it to the libraries listed below
- collection of benchmarks to measure basic GPU capabilities☆308Updated last month
- Advanced Profiling and Analytics for AMD Hardware☆142Updated this week
- SYCL Benchmark Suite☆64Updated last month
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆223Updated 3 weeks ago
- ☆61Updated 3 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- STREAM, for lots of devices written in many programming models☆330Updated 6 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆247Updated this week
- ☆138Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆214Updated 3 years ago
- Examples for HIP☆203Updated 3 months ago
- amdgpu example code in hip/asm☆29Updated last month
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆48Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆80Updated last year
- ROC profiler library. Profiling with perf-counters and derived metrics.☆137Updated last week
- Dissecting NVIDIA GPU Architecture☆90Updated 2 years ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆140Updated this week
- RAJA Performance Suite☆118Updated this week
- TPP experimentation on MLIR for linear algebra☆121Updated last week
- Simple starter CMake project that uses NVBench.☆11Updated 3 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 2 months ago
- ☆20Updated last year
- development repository for the open earth compiler☆79Updated 4 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆80Updated 5 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆63Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆462Updated last year
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆129Updated last year