avidday / hpl-cuda
simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS
☆28Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for hpl-cuda
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆88Updated 9 years ago
- HCC Sample Applications☆13Updated 7 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year
- ROCm OpenCL Compiler Tool Driver☆24Updated 5 years ago
- Compute applications.☆25Updated 4 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- MPI benchmark to test and measure collective performance☆49Updated 3 years ago
- The SHOC Benchmark Suite☆247Updated 2 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- Documents and source code related to a Hybrid HPL run for IU's BR2 machine☆15Updated 11 years ago
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- OpenSHMEM Application Programming Interface☆51Updated last week
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- RAND library for HIP programming language☆111Updated this week
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆72Updated 8 months ago
- GPU implementation of classical molecular dynamics proxy application.☆30Updated 7 years ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆63Updated last week
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆109Updated last year
- High-performance, GPU-aware communication library☆84Updated last month
- MIOpenGEMM is now deprecated☆61Updated last year
- Emulating DMA Engines on GPUs for Performance and Portability☆34Updated 9 years ago
- Examples for HIP☆200Updated 2 weeks ago
- sparse matrix pre-processing library☆81Updated 6 months ago
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-ar…☆96Updated 5 years ago
- Next generation FFT implementation for ROCm☆176Updated this week
- CUDA Tensor Transpose (cuTT) library☆50Updated 7 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆78Updated 5 years ago
- ROCm Parallel Primitives☆162Updated this week
- Multi-GPU Computing Benchmark Suite (CUDA)☆42Updated 7 years ago