High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)
☆92Oct 22, 2015Updated 10 years ago
Alternatives and similar repositories for hpl-gpu
Users that are interested in hpl-gpu are comparing it to the libraries listed below
Sorting:
- Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL☆17Apr 5, 2018Updated 7 years ago
- simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS☆29May 13, 2013Updated 12 years ago
- An implementation of HPL-AI Mixed-Precision Benchmark based on hpl-2.3☆29May 30, 2021Updated 4 years ago
- Documents and source code related to a Hybrid HPL run for IU's BR2 machine☆16Nov 27, 2012Updated 13 years ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- JIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal☆13Aug 6, 2025Updated 6 months ago
- Linpack benchmark code in C☆14Sep 4, 2012Updated 13 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Jan 30, 2017Updated 9 years ago
- An HPL-AI implementation for Fugaku☆23Jun 29, 2021Updated 4 years ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago
- Heterogeneous Programming Library. Facilitates the use of accelerators on top of OpenCL☆27Oct 11, 2018Updated 7 years ago
- Global Address SPace toolbox -- Julia wrapper☆10Nov 17, 2017Updated 8 years ago
- ROCm Command Line Profiler - Updated moved to https://github.com/GPUOpen-Tools/RCP☆10Aug 24, 2017Updated 8 years ago
- ☆12Aug 4, 2025Updated 6 months ago
- GPU-accelerated Quantum ESPRESSO using CUDA FORTRAN☆65Jan 29, 2020Updated 6 years ago
- Plug-in to accelerate Quantum ESPRESSO v5 using NVIDIA GPU☆30Feb 23, 2017Updated 9 years ago
- Unstructured mesh hydrodynamics for advanced architectures☆23Aug 16, 2023Updated 2 years ago
- ☆14Mar 21, 2019Updated 6 years ago
- Code generator for simint vectorized integrals☆29Mar 16, 2023Updated 2 years ago
- A Vectorized Implementation of the Tersoff Potential for the LAMMPS Molecular Dynamics Software☆13Nov 14, 2017Updated 8 years ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆15Oct 28, 2025Updated 4 months ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Sep 12, 2022Updated 3 years ago
- Modern Fortran wrappers around MPI routines☆35Dec 17, 2025Updated 2 months ago
- The repo is obsolete. Use at your own risk.☆12Aug 1, 2018Updated 7 years ago
- EPCC I/O benchmarking applications☆12Dec 15, 2021Updated 4 years ago
- HPCG benchmark based on ROCm platform☆39Updated this week
- Open source of an IBM Optimized version of the HPCG benchmark.☆17Sep 17, 2025Updated 5 months ago
- Simple example showing how to use DGMA in OpenCL☆13Feb 11, 2016Updated 10 years ago
- A neutral particle transport mini-app to study performance of sweeps on unstructured, 3D tetrahedral meshes.☆19Sep 20, 2022Updated 3 years ago
- OpenCL porting of the GROMACS molecular simulation toolkit☆27Sep 5, 2015Updated 10 years ago
- A Monte Carlo Neutron Transport Mini-App☆15Apr 15, 2019Updated 6 years ago
- ACT community resources☆26Oct 3, 2019Updated 6 years ago
- Tools to run and parse MKL verbose mode☆18Jun 28, 2022Updated 3 years ago
- ☆18Sep 17, 2025Updated 5 months ago
- Set of OpenCL microbenchmarks☆29Nov 19, 2025Updated 3 months ago
- A class to read ASCII Tecplot-Files with multiple sections☆19Jun 18, 2015Updated 10 years ago
- ☆180Jan 28, 2026Updated last month
- ROCm OpenCL Compiler Tool Driver☆24Nov 22, 2019Updated 6 years ago
- A python script that reads in a fortran 77 (.f or .F) fixed form file and converts it to a free form Fortran 90 file (.f90 or .F90).☆25Apr 12, 2016Updated 9 years ago