avidday / hpl-cuda
simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS
☆28Updated 11 years ago
Alternatives and similar repositories for hpl-cuda:
Users that are interested in hpl-cuda are comparing it to the libraries listed below
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆89Updated 9 years ago
- The SHOC Benchmark Suite☆251Updated 3 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- MIOpenGEMM is now deprecated☆62Updated last year
- Compute applications.☆24Updated 5 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Examples for HIP☆204Updated 4 months ago
- HCC Sample Applications☆13Updated 8 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year
- High-performance, GPU-aware communication library☆85Updated 3 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆107Updated last year
- Next generation FFT implementation for ROCm☆191Updated this week
- sparse matrix pre-processing library☆81Updated 11 months ago
- Integrated Performance Monitoring for High Performance Computing☆87Updated 3 years ago
- Full-speed Array of Structures access☆169Updated last year
- ROCm OpenCL Compiler Tool Driver☆24Updated 5 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆47Updated last month
- MPI benchmark to test and measure collective performance☆50Updated 3 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- OpenSHMEM Application Programming Interface☆54Updated 5 months ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- RAND library for HIP programming language☆117Updated this week
- Multi-GPU Computing Benchmark Suite (CUDA)☆42Updated 7 years ago
- Chai☆43Updated last year
- CLTune: An automatic OpenCL & CUDA kernel tuner☆178Updated 2 years ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 2 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆109Updated 2 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated 2 weeks ago