avidday / hpl-cudaLinks
simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS
☆29Updated 12 years ago
Alternatives and similar repositories for hpl-cuda
Users that are interested in hpl-cuda are comparing it to the libraries listed below
Sorting:
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆92Updated 10 years ago
- Compute applications.☆25Updated 6 years ago
- HCC Sample Applications☆13Updated 8 years ago
- The SHOC Benchmark Suite☆259Updated 2 months ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆43Updated 8 years ago
- MPI benchmark to test and measure collective performance☆52Updated 4 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 7 years ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆82Updated 3 years ago
- Flexible GPGPU instrumentation☆89Updated 6 years ago
- A Benchmark Suite for Heterogeneous System Computation☆54Updated 9 months ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- Set of OpenCL microbenchmarks☆29Updated 3 weeks ago
- Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL☆17Updated 7 years ago
- Examples for HIP☆213Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 9 months ago
- ☆18Updated last year
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆75Updated 3 months ago
- A library to benchmark CUDA code, similar to google benchmark.☆30Updated 4 years ago
- STREAM, for lots of devices written in many programming models☆352Updated 3 months ago
- Sparse matrix computation library for GPU☆58Updated 5 years ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆103Updated last month
- A unified framework across multiple programming platforms☆42Updated 6 months ago
- OpenSHMEM Application Programming Interface☆61Updated last year
- High-performance, GPU-aware communication library☆86Updated 11 months ago
- ☆48Updated 5 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆182Updated 3 years ago
- sparse matrix pre-processing library☆83Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆198Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆130Updated this week
- Integrated Performance Monitoring for High Performance Computing☆90Updated 4 years ago