avidday / hpl-cudaLinks
simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS
☆28Updated 12 years ago
Alternatives and similar repositories for hpl-cuda
Users that are interested in hpl-cuda are comparing it to the libraries listed below
Sorting:
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆91Updated 9 years ago
- Compute applications.☆24Updated 5 years ago
- The SHOC Benchmark Suite☆256Updated 3 years ago
- MPI benchmark to test and measure collective performance☆51Updated 4 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 4 months ago
- MPI Testing Tool☆63Updated 6 months ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 2 years ago
- HCC Sample Applications☆13Updated 8 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL☆17Updated 7 years ago
- Flexible GPGPU instrumentation☆88Updated 5 years ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆42Updated 8 years ago
- High-performance, GPU-aware communication library☆86Updated 6 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆72Updated 2 weeks ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆103Updated 3 months ago
- Integrated Performance Monitoring for High Performance Computing☆89Updated 3 years ago
- OpenSHMEM Application Programming Interface☆58Updated 8 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆109Updated 2 years ago
- OpenCL porting of the GROMACS molecular simulation toolkit☆25Updated 9 years ago
- Portals is a low-level network API for high-performance networking on high-performance computing systems developed by Sandia National Lab…☆40Updated 10 months ago
- Scripts for running various benchmarks on Isambard and other systems.☆28Updated 4 years ago
- HPCG benchmark based on ROCm platform☆37Updated 2 weeks ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆51Updated 5 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆180Updated 2 years ago
- OpenSHMEM Implementation on MPI☆27Updated 3 months ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated 2 years ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 3 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 7 years ago
- ☆85Updated 7 years ago