avidday / hpl-cuda
simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS
☆28Updated 11 years ago
Alternatives and similar repositories for hpl-cuda:
Users that are interested in hpl-cuda are comparing it to the libraries listed below
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆89Updated 9 years ago
- Compute applications.☆24Updated 5 years ago
- The SHOC Benchmark Suite☆250Updated 3 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- OpenSHMEM Application Programming Interface☆54Updated 4 months ago
- MIOpenGEMM is now deprecated☆62Updated last year
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆102Updated last week
- ROCm OpenCL Compiler Tool Driver☆24Updated 5 years ago
- Set of OpenCL microbenchmarks☆29Updated last year
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆49Updated last year
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆67Updated last week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆177Updated 2 years ago
- Integrated Performance Monitoring for High Performance Computing☆88Updated 3 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 6 years ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆78Updated last year
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- Tools for parsing, assembling, and disassembling HSAIL.☆71Updated 4 years ago
- HCC Sample Applications☆13Updated 8 years ago
- MPI benchmark to test and measure collective performance☆50Updated 3 years ago
- ☆75Updated last year
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆40Updated last year
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- sparse matrix pre-processing library☆82Updated 10 months ago
- Official BOLT Repository☆28Updated 7 months ago
- Python interface for the LIKWID C API (https://github.com/RRZE-HPC/likwid)☆45Updated 2 years ago
- Source code from NVIDIA CUDACasts☆49Updated 10 years ago
- ROCm Driver RDMA Peer to Peer Support☆20Updated 6 years ago
- A Multi-purpose, Application-Centric, Scalable I/O Proxy Application☆34Updated 4 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year