High-Performance Linpack Benchmark adopted version for GPU backend
☆12Sep 12, 2022Updated 3 years ago
Alternatives and similar repositories for HPL_GPU
Users that are interested in HPL_GPU are comparing it to the libraries listed below
Sorting:
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 7 years ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Jul 9, 2025Updated 7 months ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆67Dec 10, 2025Updated 2 months ago
- ☆17Nov 11, 2025Updated 3 months ago
- Repo for climate deep learning codes☆16May 21, 2019Updated 6 years ago
- OpenCL porting of the GROMACS molecular simulation toolkit☆27Sep 5, 2015Updated 10 years ago
- An HPL-AI implementation for Fugaku☆23Jun 29, 2021Updated 4 years ago
- A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated 11 months ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 5 months ago
- NAS Parallel Benchmarks for evaluating GPU and APIs☆29Sep 29, 2025Updated 5 months ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆25Updated this week
- MAD (Model Automation and Dashboarding)☆31Updated this week
- GPU Performance Advisor☆66Jul 25, 2022Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆29Updated this week
- library for measuring communication in distributed-memory parallel applications that use the standard Message-Passing Interface (MPI)☆22Sep 17, 2025Updated 5 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆34Mar 19, 2023Updated 2 years ago
- ROCm Machine Learning and HPC Stack installer