avidday / hpl-cudaLinks

simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS

☆28

Alternatives and similar repositories for hpl-cuda

Users that are interested in hpl-cuda are comparing it to the libraries listed below

Sorting:

davidrohr / hpl-gpu
High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)
☆91Updated 9 years ago
AMDComputeLibraries / ComputeApps
Compute applications.
☆24Updated 5 years ago
vetter / shoc
The SHOC Benchmark Suite
☆256Updated 3 years ago
LLNL / mpiBench
MPI benchmark to test and measure collective performance
☆51Updated 4 years ago
mlcommons / hpc
Reference implementations of MLPerf™ HPC training benchmarks
☆48Updated 4 months ago
open-mpi / mtt
MPI Testing Tool
☆63Updated 6 months ago
intel / MLSL
Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…
☆108Updated 2 years ago
ROCm / HCC-Example-Application
HCC Sample Applications
☆13Updated 8 years ago
NVIDIA / CoMD-CUDA
GPU implementation of classical molecular dynamics proxy application.
☆31Updated 8 years ago
davidrohr / caldgemm
Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL
☆17Updated 7 years ago
NVlabs / SASSI
Flexible GPGPU instrumentation
☆88Updated 5 years ago
tbennun / mgbench
Multi-GPU Computing Benchmark Suite (CUDA)
☆42Updated 8 years ago
LLNL / Aluminum
High-performance, GPU-aware communication library
☆86Updated 6 months ago
Sandia-OpenSHMEM / SOS
Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …
☆72Updated 2 weeks ago
LLNL / scr
SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…
☆103Updated 3 months ago
nerscadmin / IPM
Integrated Performance Monitoring for High Performance Computing
☆89Updated 3 years ago
openshmem-org / specification
OpenSHMEM Application Programming Interface
☆58Updated 8 months ago
LLNL / LULESH
Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)
☆109Updated 2 years ago
StreamHPC / gromacs
OpenCL porting of the GROMACS molecular simulation toolkit
☆25Updated 9 years ago
sandialabs / portals4
Portals is a low-level network API for high-performance networking on high-performance computing systems developed by Sandia National Lab…
☆40Updated 10 months ago
UoB-HPC / benchmarks
Scripts for running various benchmarks on Isambard and other systems.
☆28Updated 4 years ago
ROCm / rocHPCG
HPCG benchmark based on ROCm platform
☆37Updated 2 weeks ago
linnanwang / BLASX
a heterogeneous multiGPU level-3 BLAS library
☆45Updated 5 years ago
NVIDIA-OpenACC-Course / nvidia-openacc-course-sources
Contains sources related to the lectures and labs for the NVIDIA OpenACC course.
☆51Updated 5 years ago
CNugteren / CLTune
CLTune: An automatic OpenCL & CUDA kernel tuner
☆180Updated 2 years ago
pmodels / oshmpi
OpenSHMEM Implementation on MPI
☆27Updated 3 months ago
ROCm / Thrust
HIP back-end for Thrust that has been replaced by rocThrust
☆28Updated 2 years ago
deep500 / deep500
A Deep Learning Meta-Framework and HPC Benchmarking Library
☆81Updated 3 years ago
e-ago / hpgmg-cuda-async
GPUDirect Async implementation of HPGMG-FV CUDA
☆11Updated 7 years ago
OpenACCUserGroup / openacc-users-group
☆85Updated 7 years ago