High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)
☆92Oct 22, 2015Updated 10 years ago
Alternatives and similar repositories for hpl-gpu
Users that are interested in hpl-gpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL☆17Apr 5, 2018Updated 8 years ago
- simple port of hpl-2.0 to use NVIDIA GPU accelation with CUBLAS☆29May 13, 2013Updated 12 years ago
- Documents and source code related to a Hybrid HPL run for IU's BR2 machine☆16Nov 27, 2012Updated 13 years ago
- Heterogeneous Programming Library. Facilitates the use of accelerators on top of OpenCL☆27Oct 11, 2018Updated 7 years ago
- JIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal☆13Aug 6, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- GPU implementation of classical molecular dynamics proxy application.☆31Jan 30, 2017Updated 9 years ago
- The repo is obsolete. Use at your own risk.☆12Aug 1, 2018Updated 7 years ago
- ROCm Command Line Profiler - Updated moved to https://github.com/GPUOpen-Tools/RCP☆10Aug 24, 2017Updated 8 years ago
- Unstructured mesh hydrodynamics for advanced architectures☆23Aug 16, 2023Updated 2 years ago
- Plug-in to accelerate Quantum ESPRESSO v5 using NVIDIA GPU☆31Feb 23, 2017Updated 9 years ago
- GPU-accelerated Quantum ESPRESSO using CUDA FORTRAN☆66Jan 29, 2020Updated 6 years ago
- Simple example showing how to use DGMA in OpenCL☆13Feb 11, 2016Updated 10 years ago
- A Vectorized Implementation of the Tersoff Potential for the LAMMPS Molecular Dynamics Software☆13Nov 14, 2017Updated 8 years ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆17Oct 28, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 8 months ago
- Code generator for simint vectorized integrals☆29Mar 16, 2023Updated 3 years ago
- An HPL-AI implementation for Fugaku☆23Jun 29, 2021Updated 4 years ago
- Integrated Performance Monitoring for High Performance Computing☆92Nov 5, 2021Updated 4 years ago
- Modern Fortran wrappers around MPI routines☆36Dec 17, 2025Updated 4 months ago
- ☆11Aug 8, 2021Updated 4 years ago
- Traffic Prediction in PaddlePaddle (ASC17 Deep Learning Application)☆17Apr 27, 2017Updated 9 years ago
- ROCm OpenCL Compiler Tool Driver☆24Nov 22, 2019Updated 6 years ago
- Data Accelerator: Creates a burst buffer from generic hardware and integrates it with Slurm https://www.hpc.cam.ac.uk/research/data-acc h…☆18Mar 30, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Linux kernel repository merging linux-linaro-stable and freescale mx6 patchsets☆30May 22, 2015Updated 10 years ago
- Connect to a LSF main node directly or trough a ssh jump node, launch a jupyter notebook via bsub and open automatically a tunnel. The n…☆20Oct 27, 2021Updated 4 years ago
- ☆184Mar 29, 2026Updated last month
- (Deprecated) hipCaffe: the HIP port of Caffe☆124May 1, 2024Updated 2 years ago
- HPCG benchmark based on ROCm platform☆41Updated this week
- Official HPCG benchmark source code☆343Jul 5, 2024Updated last year
- Comb is a communication performance benchmarking tool.☆26Feb 27, 2023Updated 3 years ago
- MIOpenGEMM is now deprecated☆61Jul 17, 2023Updated 2 years ago
- OpenCL porting of the GROMACS molecular simulation toolkit☆27Sep 5, 2015Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A neutral particle transport mini-app to study performance of sweeps on unstructured, 3D tetrahedral meshes.☆19Sep 20, 2022Updated 3 years ago
- Experimental benchmark and test toolkit for optimized ARM memcpy/memset functions in the Linux kernel☆14Aug 28, 2013Updated 12 years ago
- GPU Eigensolver for symmetric/hermitian matrices.☆69Oct 25, 2021Updated 4 years ago
- ☆12Aug 4, 2025Updated 8 months ago
- ☆18Sep 17, 2025Updated 7 months ago
- A sparse BLAS lib supporting multiple backends☆51Mar 18, 2026Updated last month
- Open-source tools for Geekbench☆12Feb 3, 2026Updated 3 months ago