zjin-lcf / HeCBench
☆236Updated this week
Alternatives and similar repositories for HeCBench:
Users that are interested in HeCBench are comparing it to the libraries listed below
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated last week
- STREAM, for lots of devices written in many programming models☆332Updated 7 months ago
- Advanced Profiling and Analytics for AMD Hardware☆145Updated this week
- collection of benchmarks to measure basic GPU capabilities☆354Updated 2 months ago
- ☆61Updated 3 months ago
- SYCL Benchmark Suite☆64Updated last month
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆82Updated last week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- ☆20Updated 2 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆216Updated 3 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆134Updated this week
- RAJA Performance Suite☆119Updated this week
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆102Updated 10 months ago
- ☆141Updated this week
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆269Updated 2 weeks ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆51Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆141Updated this week
- amdgpu example code in hip/asm☆29Updated 2 months ago
- ROCm Parallel Primitives☆171Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆253Updated 3 weeks ago
- ☆43Updated 4 years ago
- 14 basic topics for VEGA64 performance optmization☆54Updated 4 years ago
- Next generation SPARSE implementation for ROCm platform☆119Updated this week
- Dissecting NVIDIA GPU Architecture☆90Updated 2 years ago
- SYCL Open Source Specification☆134Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆235Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- development repository for the open earth compiler☆79Updated 4 years ago