avidday / hpl-cudaLinks
simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS
☆29Updated 12 years ago
Alternatives and similar repositories for hpl-cuda
Users that are interested in hpl-cuda are comparing it to the libraries listed below
Sorting:
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆92Updated 10 years ago
- Compute applications.☆25Updated 6 years ago
- HCC Sample Applications☆13Updated 9 years ago
- The SHOC Benchmark Suite☆259Updated 3 months ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆183Updated 3 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆198Updated last week
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 7 years ago
- Examples for HIP☆213Updated last year
- OpenCL porting of the GROMACS molecular simulation toolkit☆27Updated 10 years ago
- Set of OpenCL microbenchmarks☆29Updated last month
- Multi-GPU Computing Benchmark Suite (CUDA)☆43Updated 8 years ago
- Flexible GPGPU instrumentation☆89Updated 6 years ago
- OpenSHMEM Application Programming Interface☆62Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆130Updated 2 weeks ago
- ROCm Device Libraries☆96Updated last year
- ☆74Updated 2 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 10 months ago
- High-performance, GPU-aware communication library☆86Updated 3 weeks ago
- Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL☆17Updated 7 years ago
- MPI benchmark to test and measure collective performance☆52Updated 4 years ago
- ROCm OpenCL Compiler Tool Driver☆24Updated 6 years ago
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆36Updated 2 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆109Updated 8 years ago
- A unified framework across multiple programming platforms☆42Updated 7 months ago
- ☆55Updated 2 years ago
- A Benchmark Suite for Heterogeneous System Computation☆55Updated 10 months ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆103Updated 2 months ago
- HPCG benchmark based on ROCm platform☆38Updated 2 months ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year