ARM-software / HPCG_for_Arm
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for HPCG_for_Arm
- ☆16Updated 5 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 6 years ago
- ☆41Updated 4 years ago
- This package includes the implementation for Sparse-Matrix-Vector-Multiplication (SpMV) and Sparse-Matrix-Matrix-Multiplication (SpMM) fo…☆10Updated 4 years ago
- ☆58Updated last month
- development repository for the open earth compiler☆77Updated 3 years ago
- ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆39Updated last year
- ☆17Updated 2 years ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆43Updated last month
- ☆47Updated 5 years ago
- Measure instruction latency and throughput☆22Updated 2 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- A Benchmark Suite for Heterogeneous System Computation☆52Updated 3 weeks ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆45Updated last month
- CSR-based SpGEMM on nVidia and AMD GPUs☆45Updated 8 years ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆24Updated 4 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆33Updated 3 years ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆72Updated 8 months ago
- ☆14Updated 4 years ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆30Updated 3 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆29Updated 2 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated this week
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆20Updated 6 years ago
- SYCL Benchmark Suite☆56Updated 2 months ago
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆40Updated 6 years ago
- Advanced Profiling and Analytics for AMD Hardware☆137Updated this week
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- The University of Bristol HPC Simulation Engine☆93Updated last week
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆33Updated 2 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆34Updated 9 years ago